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TTGGGCAAGG AATTTCAGTG ACACAAACAC 
GCTATTGCTA ATGATGGAGT TATGCTGGAG 
TTATGATACT AACAATCAGT CTGTACGTAA 
GAAATCCTGT TTCCAAAGAG GCAGCAAGCA 
5 TTAGTTGGGA CGGACCCTCT ATATGGAACT 
AAAGCCAATT ATAACAGTTC CTGGACAAAA 
CGGCTCAAAT CGCTGATGAG AAAAATGGAG 
AATTATATTT TCTCAGTTGT GACTATGAAT 
TATCTTGTAT GTAACGGTTC AACAGCCTGA 

10 TGGGAGAATT TGCCACCCCA ATCTTGGAGC 
TCTCTCAATC TTCAATCTCC AGCCAAAAAT 
ATCTTCTTAT GCAATGCCTA GCATCAAGGA 
CGGAAGCCTT ACGCCGAAAT ATTGTGCAAC 
ACAAAGATTA AAGAGACTTC TGTAGAAGAA 

15 CCAACAAGTT CTCCTTTTAT CGGATAAGGT 
ATGGCTGGAA AAAAGAGACT GCCGAGACCT 
GAACTGGAAT TTGAAGGTTC AGGTTCCGTT 



PCT/CAOO/01150 

AAATGCTTCG TGCCTTTACA 900 

CCAAAATTTA TAAGTGCTAT 950 

GTCACAAAAA GAAATAGTAG 1000 

CAACTCGAAA TCACATGATC 1050 

ATGTATAATC ACTACACAGG 1100 

TGTAGCAGTT AAATCCGGTA 1150 

GATACTTGGT TGGTTCTACC 12 00 

CCTGCTGAAA ATCCTGATTT 12 50 

GCATTATTCA GGTATCCAGT 13 00 

GGGCTTCAGC TATGAAAGAA 1350 

TTAGATAAAG TTACGACAGA 1400 

TATTTCACCT GGTGAGTTGG 14 50 

CAATCGTTGT AGGTACTGGA 1500 

GGGACCAATC TTGCACCAAA 1550 

AGAAGAAATT CCAGACATGT 1600 

TTGCTAAATG GTTGGATATT 1650 

GTTCAGAAG 1689 



20 



2) INFORMATION FOR SEQ ID NO: 1047 



(i) SEQUENCE CHARACTERISTICS: 
(A) LENGTH: 1690 bases 
25 (B) TYPE: Nucleic acid 

(C) STRANDEDNESS : Double 

(D) TOPOLOGY: Linear 



30 



(ii) MOLECULE TYPE: Genomic DNA 

(vi) ORIGINAL SOURCE: 

(A) ORGANISM: Streptococcus pneumoniae 

(B) STRAIN: StrR-14 

35 (xi) SEQUENCE DESCRIPTION: SEQ ID NO: 1047 

CAAAAAAGAA TTGGAAGCTG CAGAGGTCAA GGGGATTGAT TTTACAACCA 50 

GTCCCAACCG TAGTTACCCA AACGGACAAT TTGCTTCTAG TTTTATCGGA 100 

CTAGCTCAGC TCCATGAAAA TGAAGATGGC AGCAAGAGCT TGCTGGGAAC 150 

40 TTCTGGAATG GAGAGTTCCT TGAACAGTAT TCTTGCAGGG ACAGACGGCA 200 

TTATTACCTA TGAAAAGGAT CGTCTGGGTA ATATTGTACC CGGAACAGAA 2 50 

CAAGTTTCCC AACAAACGGT AGATGGCT^G GATGTTTACA CAACCATTTC 3 00 

CAGCCCCCTC CAGTCCTTTA TGGAAACCCA GATGGATGCT TTTCAAGAGA 350 

AGGTAAAAGG AAAGTACACG ACAGCGACTT TGGTCAGTGC TAAAACAGGG 400 

45 GAAATTCTGG CAACAACGCA ACGACCGACC TTTGATGCAG ATACAAAAGA 4 50 

AGGCATTACA GAGGACTTTG TTTGGCGTGA TATCCTTTAC CAAAGTAACT 500 

ATGAGCCAGG TTCCCCTATG AAAGTGATGA TGTTGGCTGC TGCTATTGAT 550 

AATAATACCT TTCCAGGGGG AGAAGTCTTC AATAGTAGTG AGTTAAAAAT 600 

TGCAGATGTC ACGATTCGAG ATTGGGACGT CAATGAAGGA TTGACTGGTG 650 

50 GCAGAATGAT GACCTTTTCT CAAGGGTTCG CTCACTCAAG TAACGTTGGG 700 

ATGACGCTTC TTGAGCAAAA GATGGGAGAT GCTACATGGC TTGATTATCT 750 

AAATCGCTTT AAATTTGGTG TTCCGACTCG TTTTGGCTTG ACGGATGAAT 800 

ATGCAGGCCA ACTTCCAGCT GACAATATCG TAAATATTGC TCAGAGTTCA 850 

TTTGGACAAG GGATTTCAGC GACCCAGACG CAAATGATTC GTGCCTTCAC 900 

55 GGCTATTGCC AACGATGGAG TCATGTTAGA ACCTAAATTT ATCAGTGCCA 950 

TTTATGATCC AAATGATCAA ACTGCTCGGA AATCACAA7A AGAAGTTGTG 1000 

GGAAATCCTG TGTCTAAAGA TGCAGCGAGC TTGACGCGAA CGCATATGGT 1050 

TTTAGTCGGT ACCGATCCAG TATATGGAAC TATGTATAAT CATAAGACAG 1100 

GGAAACCAAC TGTAACTGTT CCTGGGCAAA ATGTAGCCCT CAAGTCTGGT 1150 

60 ACGGCTCAGA TTGCCGATGA GAAAAATGGA GGTTACTTAG TTGGTACGAC 1200 

591 
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CAATTACATT TTTTCGGCTG 
TTATTCTCTA TGTGACGGTT 
TTGGGGGAAT TTGCCAATCC 
TTCCCTTAAC CTCCAATCTA 
5 AAAGCGCTTA TGCCATGCCT 
GCGGAAGCCT TACGTCGCAA 
AACAAAAATT AAAGAATCAT 
ACCAGCAAGT TCTTCTCTTA 
TATGGTTGGA CAAAAGAGAC 
10 AGAACTTGAA TTTGAAGGTT 



TATCGATGAA CCCTGCTGAA 
CAACAGCCTG AGCATTATTC 
TATCTTGGAA AGGGCAGTGG 
CCGCTAAAAC GTTAAATCAG 
AGCATCAAGG ACATTTCACC 
TATTGTGCAA CCAATCGTTG 
CTGTAGAAGA AGGGACGGAT 
TCTGATAAAG CAGAGGAAGT 
TGCTGAGACC TTTGCTAAGT 
CGGGCTCTAC TGTGCAGAAG 



PCT/CAOO/01150 

AATCCTGATT 1250 

AGGTATTCAG 1300 

CTATGAAAGA 1350 

GTAACCAATC 140 0 

TGGCGATTTG 1450 

TAGGAACAGG 1500 

CTTGCACCTA 1550 

TCCAGATATG 1600 

GGCTCAATAT 1650 
1690 



2 ) INFORMATION FOR SEQ ID NO: 1048 

15 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 1682 bases 

(B) TYPE: Nucleic acid 

(C) STRANDEDNESS : Double 
20 (D) TOPOLOGY: Linear 



(ii) MOLECULE TYPE: Genomic DNA 



(vi) ORIGINAL SOURCE: 
25 (A) ORGANISM: Streptococcus pneumoniae 

(B) STRAIN: StrR-15 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 1048 



30 AAAAGAGTTG GAAACTGCAG AGGTCAAGGG GATTGATTTT ACAACCAGTC 50 

CCAATCGTAG TTACCCAAAC GGACAATTTG CTTCTAGCTT TATTGGCTTA 100 

GCCCAACTTC ATGAAAATGA GGATGGTAGT AAGAGTTTGT TAGGGACTTC 150 

TGGTTTGGAG AGTTCTTTAA ATACCATTCT TGCTGGGACA GACGGTATTA 200 

TTACCTATGA AAAAGACCGT GTAGGAAATA TCGTACCAGG TACAGAACTG 250 

35 GTATCGCT^C AAACTGTGGA TGGCAAGGAT GTTTATACAA CATTGTCTAG 3 00 

TCCGCTACAA TCTTTCATGG AAACTCAGAT GGATGCCTTT CTAGAAAAAG 350 

TAAAAGGTAA GTATATGACC GCGACCTTGG TCAGTGCAAA GACCGGTGAA 400 

ATCCTCGCTA CCACCCAACG ACCTACCTTT AATGCAGATA CTAAAGAAGG 450 

AATCACTGAG GACTTTGTTT GGCGTGATAT TCTTTATCAA AGTAACTATG 500 

40 AACCAGGATC AGCCATGAAG GTTATGACGT TAGCTTCTTC TATTGATAAT 550 

AATACCTTCC CAAGTGGAGA ATACTTCAAT AGCAGTGAAT TCAAAATAGC 600 

GGATGCGACG ACTCGAGATT GGGATGTTAA TGAAGGTTTG ACTACTGGTG 650 

GGATGATGAC TTTCTTACAA GGTTTCGCTC ACTCCAGTAA TGTTGGAATG 7 00 

AGTCTACTTG AACAT^AAAAT GGGAGATGCT ACTTGGTTGG ATTATCTAAA 750 

45 ACGCTTTAAA TTTGGGGTTC CAACTCGCTT TGGCTTGACA GATGAATACG 800 

CTGGTCAACT TCCAGCTGAT AATATTGTTA GTATTGCTCA AAGCTCATTT 850 

GGGCAAGGAA TTTCAGTGAC ACAAACACAA ATGCTTCGTG CCTTTACAGC 900 

TATTGCTAAT GATGGAGTTA TGCTGGAGCC AAAATTTATA AGTGCTATTT 950 

ATGATACTAA CAATCAGTCT GTACGTAAGT CACAAAAAGA AATAGTAGGA 100 0 

50 AATCCTGTTT CCAAAGAGGC AGCAAGCACA ACTCGAAATC ACATGATCTT 1050 

AGTTGGGACG GACCCTCTAT ATGGAACTAT GTATAATCAC TACACAGGAA 1100 

AGCCAATTAT AACAGTTCCT GGACAAAATG TAGCAGTTAA ATCCGGTACG 1150 

GCTCAAATCG CTGATGAGAA AAATGGAGGA TACTTGGTTG GTTCTACCAA 1200 

TTATATTTTC TCAGTTGTGA CTATGAATCC TGCTGAAAAT CCTGATTTTA 1250 

55 TCTTGTATGT AACGGTTCAA CAGCCTGAGC ATTATTCAGG TATCCAGTTG 1300 

GGAGAATTTG CCACCCCAAT CTTGGAGCGG GCTTCAGCTA TGAAAGAATC 1350 

TCTCAATCTT CAATCTCCAG CCAAAAATTT AGATAAAGTT ACGACAGAAT 14 00 

CTTCTTATGC AATGCCTAGC ATCAAGGATA TTTCACCTGG TGAGTTGGCG 1450 

GAAGCCTTAC GCCGAAATAT TGTGCAACCA ATCGTTGTAG GTACTGGAAC 1500 

60 AAAGATTAAA GAGACTTCTG TAGAAGAAGG GACCAATCTT GCACCAAACC 1550 

592 
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PCT/CAOO/01150 



AACAAGTTCT CCTTTTATCG GATAAGGTAG AAGAAATTCC AGACATGTAT 
GGCTGGAAAA AAGAGACTGC TGAAACCTTT GCTAAATGGT TGGATATTGA 
GTTGGAATTT GAAGGTTCAG GTTCCGTCGT TC 



1600 
1650 
1682 



2) INFORMATION FOR SEQ ID NO: 1049 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 1241 bases 

(B) TYPE: Nucleic acid 

(C) STRANDEDNESS : Double 

(D) TOPOLOGY: Linear 

(ii) MOLECULE TYPE: Genomic DNA 

(vi) ORIGINAL SOURCE: 

(A) ORGANISM: Enterococcus faecium 

(B) STRAIN: R690 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 1049 

GCAGCGTTGC GTGATACCGT TGAAAAAACC ATTAAAAACT GTTTGGATTT 50 

TGAAAGGAGA CAGGAGCATG AATAGAATAA AAGTTGCAAT ACTGTTTGGG 100 

GGTTGCTCAG AGGAGCATGA CGTATCGGTA AAATCTGCAA TAGAGATAGC 150 

CGCTAACATT AATAAAGAAA AATACGAGCC GTTATACATT GGAATTACGA 200 

AATCTGGTGT ATGGAAAATG TGCGAAAAAC CTTGCGCGGA ATGGGAAAAC 250 

GACAATTGCT ATTCAGCTGT ACTCTCGCCG GATAAAAAAA TGCACGGATT 3 00 

ACTTGTTAAA AAGAACCATG AATATGAAAT CAACCATGTT GATGTAGCAT 350 

TTTCAGCTTT GCATGGCAAG TCAGGTGAAG ATGGATCCAT ACAAGGTCTG 4 00 

TTTGAATTGT CCGGTATCCC TTTTGTAGGC TGCGATATTC AAAGCTCAGC 450 

AATTTGTATG GACAAATCGT TGACATACAT CGTTGCGAAA AATGCTGGGA 500 

TAGCTACTCC CGCCTTTTGG GTTATTAATA AAGATGATAG GCCGGTGGCA 550 

GCTACGTTTA CCTATCCTGT TTTTGTTAAG CCGGCGCGTT CAGGCTCATC 600 

CTTCGGTGTG AAAAAAGTCA ATAGCGCGGA CGAATTGGAC TACGCAATTG 650 

AATCGGCAAG ACAATATGAC AGCAAAATCT TAATTGAGCA GGCTGTTTCG 700 

GGCTGTGAGG TCGGTTGTGC GGTATTGGGA AACAGTGCCG CGTTAGCTGT 750 

TGGCGAGGTG GACCAAATCA GGCTGCAGTA CGGAATCTTT CGTATTCATC 800 

AGGAAGTCGA GCCGGAAAAA GGCTCTGAAA ACGCAGTTAT AACCGTTCCC 850 

GCAGACCTTT CAGCAGAGGA GCGAGGACGG ATACAGGAAA CGGCAAAAAA 900 

AATATATAAA GCGCTCGGCT GTAGAGGTCT AGCCCGTGTG GATATGTTTT 950 

TACAAGATAA CGGCCGCATT GTACTGAACG AAGTCAATAC TCTGCCCGGT 1000 

TTCACGTCAT ACAGTCGTTA TCCCCGTATG ATGGCCGCTG CAGGTATTGC 1050 

ACTTCCCGAA CTGATTGACC GCTTGATCGT ATTAGCGTTA AAGGGGTGAT 1100 

AAGCATGGAA ATAGGATTTA CTTTTTTAGA TGAAATAGTA CACGGTGTTC 1150 

GTTGGGACGC TAAATATGCC ACTTGGGATA ATTTCACCGG AAAACCGGTT 12 00 

GACGGTTATG AAGTAAATCG CATTGTAGGG ACATACGAAT T 1241 



2) INFORMATION FOR SEQ ID NO: 1050 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 1249 bases 

(B) TYPE: Nucleic acid 

(C) STRANDEDNESS: Double 

(D) TOPOLOGY: Linear 

(ii) MOLECULE TYPE: Genomic DNA 



593 
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(vi) ORIGINAL SOURCE: 

(A) ORGANISM: Enterococcus galllnaLrum 

(B) STRAIN: R691 

5 (xi) SEQUENCE DESCRIPTION: SEQ ID NO : 1050 

TCACACCGCA TACGGCCTAT TATACCGAGC AAGCGTTGCG TGATACCGTT 50 

GAAAAAACCA TTAAAAACTG TTTGGATTTT GAAAGGAGAC AGGAGCATGA 10 0 

ATAGAATAAA AGTTGCAATA CTGTTTGGGG GTTGCTCAGA GGAGCATGAC 150 

10 GTATCGGTAA AATCTGCAAT AGAGATAGCC GCTAACATTA ATAAAGAAT^ 200 

ATACGAGCCG TTATACATTG GAATTACGAA ATCTGGTGTA TGGAAAATGT 250 

GCGAAAAACC TTGCGCGGAA TGGGAAAACG ACAATTGCTA TTCAGCTGTA 300 

CTCTCGCCGG ATAAAAAAAT GCACGGATTA CTTGTTAAAA AGAACCATGA 350 

ATATGAAATC AACCATGTTG ATGTAGCATT TTCAGCTTTG CATGGCAAGT 400 

15 CAGGTGAAGA TGGATCCATA CAAGGTCTGT TTGAATTGTC CGGTATCCCT 450 

TTTGTAGGCT GCGATATTCA AAGCTCAGCA ATTTGTATGG ACAAATCGTT 500 

GACATACATC GTTGCGAAAA ATGCTGGGAT AGCTACTCCC GCCTTTTGGG 550 

TTATTAATAA AGATGATAGG CCGGTGGCAG CTACGTTTAC CTATCCTGTT 600 

TTTGTTAAGC CGGCGCGTTC AGGCTCATCC TTCGGTGTGA AAAAAGTCAA 650 

20 TAGCGCGGAC GAATTGGACT ACGCAATTGA ATCGGCAAGA CAATATGACA 700 

GCAAAATCTT AATTGAGCAG GCTGTTTCGG GCTGTGAGGT CGGTTGTGCG 750 

GTATTGGGAA ACAGTGCCGC GTTAGCTGTT GGCGAGGTGG ACCAAATCAG 800 

GCTGCAGTAC GGAATCTTTC GTATTCATCA GGAAGTCGAG CCGGAT^AAAG 850 

GCTCTGAAAA CGCAGTTATA ACCGTTCCCG CAGACCTTTC AGCAGAGGAG 900 

25 CGAGGACGGA TACAGGAAAC GGCAAAAAAA ATATATAAAG CGCTCGGCTG 950 

TAGAGGTCTA GCCCGTGTGG ATATGTTTTT ACAAGATAAC GGCCGCATTG 1000 

TACTGAACGA AGTCAATACT CTGCCCGGTT TCACGTCATA CAGTCGTTAT 1050 

CCCCGTATGA TGGCCGCTGC AGGTATTGCA CTTCCCGAAC TGATTGACCG 1100 

CTTGATCGTA TTAGCGTTAA AGGGGTGATA AGCATGGAAA TAGGATTTAC 1150 

30 TTTTTTAGAT GAAATAGTAC ACGGTGTTCG TTGGGACGCT AAATATGCCA 12 0 0 

CTTGGGATAA TTTCACCGGA AAACCGGTTG ACGGTTATGA AGTAAATCG 1249 



35 2 ) INFORMATION FOR SEQ ID NO: 1051 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 12 72 bases 

(B) TYPE: Nucleic acid 
40 (C) STRANDEDNESS : Double 

(D) TOPOLOGY: Linear 

(ii) MOLECULE TYPE: Genomic DNA 

45 (vi) ORIGINAL SOURCE: 

(A) ORGANISM: Ent:Grococcus faBClum 

(B) STRAIN: R481 



50 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 1051 



TATACCGAGC AAGCGTTGCG TGATACCGTT GAAAAAACCA TTAAAAACTG 50 

TTTGGATTTT GAAAGGAGAC AGGAGCATGA ATAGAATAAA AGTTGCAATA 100 

CTGTTTGGGG GTTGCTCAGA GGAGCATGAC GTATCGGTAA AATCTGCAAT 150 

AGAGATAGCC GCTAACATTA ATAAAGAAAA ATACGAGCCG TTATACATTG 200 

55 GAATTACGAA ATCTGGTGTA TGGAAAATGT GCGAAAAACC TTGCGCGGAA 250 

TGGGAAAACG ACAATTGCTA TTCAGCTGTA CTCTCGCCGG ATAAAAAAAT 300 

GCACGGATTA CTTGTTAAAA AGT^CCATGA ATATG7VAATC AACCATGTTG 350 

ATGTAGCATT TTCAGCTTTG CATGGCAAGT CAGGTGAAGA TGGATCCATA 400 

CAAGGTCTGT TTGAATTGTC CGGTATCCCT TTTGTAGGCT GCGATATTCA 450 

60 AAGCTCAGCA ATTTGTATGG ACAAATCGTT GACATACATC GTTGCGAAAA 500 
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ATGCTGGGAT AGCTACTCCC GCCTTTTGGG 
CCGGTGGCAG CTACGTTTAC CTATCCTGTT 
AGGCTCATCC TTCGGTGTGA AAAAAGTCAA 
ACGCAATTGA ATCGGCAAGA CAATATGACA 
5. GCTGTTTCGG GCTGTGAGGT CGGTTGTGCG 
GTTAGTTGTT GGCGAGGTGG ACCAAATCAG 
GTATTCATCA GGAAGTCGAG CCGGAAAAAG 
ACCGTTCCCG CAGACCTTTC AGCAGAGGAG 
GGCAAAAAAA ATATATAAAG CGCTCGGCTG 

10 ATATGTTTTT ACAAGATAAC GGCCGCATTG 
CTGCCCGGTT TCACGTCATA CAGTCGTTAT 
AGGTATTGCA CTTCCCGAAC TGATTGACCG 
AGGGGTGATA AGCATGGAAA TAGGATTTAC 
ACGGTGTTCG TTGGGACGCT AAATATGCCA 

15 AAACCGGTTG ACGGTTATGA AGTAAATCGC 
GGCTTGAATC GCTTTTTGAA GG 



PCT/CAOO/01150 

TTATTAATAA AGATGATAGG 550 

TTTGTTAAGC CGGCGCGTTC 600 

TAGCGCGGAC GAATTGGACT 650 

GCAAAATCTT AATTGAGCAG 700 

GTATTGGGAA ACAGTGCCGC 750 

GCTGCAGTAC GGAATCTTTC 800 

GCTCTGAAAA CGCAGTTATA 85 0 

CGAGGACGGA TACAGGAAAC 900 

TAGAGGTCTA GCCCGTGTGG 950 

TACTGAACGA AGTCAATACT 1000 

CCCCGTATGA TGGCCGCTGC 1050 

CTTGATCGTA TTAGCGTTAA 1100 

TTTTTTAGAT GAAATAGTAC 1150 

CTTGGGATAA TTTCACCGGA 1200 

ATTGTAGGGA CATACGAATT 12 50 

1272 



20 2) INFORMATION FOR SEQ ID NO: 1052 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 123 7 bases 

(B) TYPE: Nucleic acid 
25 (C) STRANDEDNESS: Double 

(D) TOPOLOGY: Linear 

(ii) MOLECULE TYPE: Genomic DNA 

3 0 (vi) ORIGINAL SOURCE: 

(A) ORGT^ISM: Enterococcus fetecium 

(B) STRAIN: R4 92 



35 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 1052 



TCCCCCGGCA TACGGCCTAT TATACCGAGC AAGCGTTGCG TGATACCGTT 50 

GAAAAAACCA TTAAAAACTG TTTGGATTTT GAAAGGAGAC AGGAGCATGA 100 

ATAGAATAAA AGTTGCAATA CTGTTTGGGG GTTGCTCAGA GGAGCATGAC 150 

GTATCGGTAA AATCTGCAAT AGAGATAGCC GCTAACATTA ATAAAGAAAA 2 00 

4 0 ATACGAGCCG TTATACATTG GAATTACGAA ATCTGGTGTA TGGAAAATGT 250 

GCGAAAAACC TTGCGCGGAA TGGGAAAACG ACAATTGCTA TTCAGCTGTA 3 00 

CTCTCGCCGG ATAAAAAAAT GCACGGATTA CTTGTTAAAA AGAACCATGA 3 50 

ATATGAAATC AACCATGTTG ATGTAGCATT TTCAGCTTTG CATGGCAAGT 4 00 

CAGGTGAAGA TGGATCCATA CAAGGTCTGT TTGAATTGTC CGGTATCCCT 450 

45 TTTGTAGGCT GCGATATTCA AAGCTCAGCA ATTTGTATGG ACAAATCGTT 500 

GACATACATC GTTGCGAAAA ATGCTGGGAT AGCTACTCCC GCCTTTTGGG 550 

TTATTAATAA AGATGATAGG CCGGTGGCAG CTACGTTTAC CTATCCTGTT 600 

TTTGTTAAGC CGGCGCGTTC AGGCTCATCC TTCGGTGTGA AAAAAGTCAA 650 

TAGCGCGGAC GAATTGGACT ACGCAATTGA ATCGGCAAGA CAATATGACA 700 

50 GCAAAATCTT AATTGAGCAG GCTGTTTCGG GCTGTGAGGT CGGTTGTGCG 750 

GTATTGGGAA ACAGTGCCGC GTTAGCTGTT GGCGAGGTGG ACCAAATCAG 8 00 

GCTGCAGTAC GGAATCTTTC GTATTCATCA GGAAGTCGAG CCGGAAAAAG 850 

GCTCTGAAAA CGCAGTTATA ACCGTTCCCG CAGACCTTTC AGCAGAGGAG 900 

CGAGGACGGA TACAGGAAAC GGCAAAAAAA ATATATAAAG CGCTCGGCTG 950 

55 TAGAGGTCTA GCCCGTGTGG ATATGTTTTT ACAAGATAAC GGCCGCATTG 1000 

TACTGAACGA AGTCAATACT CTGCCCGGTT TCACGTCATA CAGTCGTTAT 1050 

CCCCGTATGA TGGCCGCTGC AGGTATTGCA CTTCCCGAAC TGATTGACCG 1100 

CTTGATCGTA TTAGCGTTAA AGGGGTGATA AGCATGGAAA TAGGATTTAC 1150 

TTTTTTAGAT GAAATAGTAC ACGGTGTTCG TTGGGACGCT AAATATGCCA 12 00 

60 CTTGGGATAA TTTCACCGGA AAACCGGTTG ACGGTTA 1237 
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2) INFORMATION FOR SEQ ID NO: 1053 

5 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 12 63 bases 

(B) TYPE: Nucleic acid 

(C) STRANDEDNESS : Double 
10 (D) TOPOLOGY: Linear 

(ii) MOLECULE TYPE: Genomic DNA 

(vi) ORIGINAL SOURCE: 
15 (A) ORGANISM: Enterococcus faecixim 

(B) STRAIN: R581 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 1053 

20 CATACGGCCT ATTATACCGA GCAAGCGTTG CGTGATACCG TTGAAAAAAC 50 

CATTAAAAAC TGTTTGGATT TTGAAAGGAG ACAGGAGCAT GAATAGAATA 10 0 

AT^GTTGCAA TACTGTTTGG GGGTTGCTCA GAGGAGCATG ACGTATCGGT 150 

AA/y^TCTGCA ATAGAGATAG CCGCTAACAT TAATAAAGAA AAATACGAGC 2 00 

CGTTATACAT TGGAATTACG AAATCTGGTG TATGGAAAAT GTGCGAAAAA 2 50 

25 CCTTGCGCGG AATGGGAAAA CGACAATTGC TATTCAGCTG TACTCTCGCC 300 

GGATAAAAAA ATGCACGGAT TACTTGTTAA AAAGAACCAT GAATATGAAA 350 

TCAACCATGT TGATGTAGCA TTTTCAGCTT TGCATGGCAA GTCAGGTGAA 400 

GATGGATCCA TACAAGGTCT GTTTGAATTG TCCGGTATCC CTTTTGTAGG 4 50 

CTGCGATATT CAAAGCTCAG CAATTTGTAT GGACAAATCG TTGACATACA 500 

30 TCGTTGCGAA AAATGCTGGG ATAGCTACTC CCGCCTTTTG GGTTATTAAT 550 

AAAGATGATA GGCCGGTGGC AGCTACGTTT ACCTATCCTG TTTTTGTTAA 600 

GCCGGCGCGT TCAGGCTCAT CCTTCGGTGT GAAAAAAGTC AATAGCGCGG 65 0 

ACGAATTGGA CTACGCAATT GAATCGGCAA GACAATATGA CAGCAAAATC 70 0 

TTAATTGAGC AGGCTGTTTC GGGCTGTGAG GTCGGTTGTG CGGTATTGGG 750 

35 AAACAGTGCC GCGTTAGCTG TTGGCGAGGT GGACCAAATC AGGCTGCAGT 800 

ACGGAATCTT TCGTATTCAT CAGGAAGTCG AGCCGGAAAA AGGCTCTGAA 850 

AACGCAGTTA TAACCGTTCC CGCAGACCTT TCAGCAGAGG AGCGAGGACG 90 0 

GATACAGGAA ACGGCAAAAA AAATATATAA AGCGCTCGGC TGTAGAGGTC 95 0 

TAGCCCGTGT GGATATGTTT TTACAAGATA ACGGCCGCAT TGTACTGAAC 1000 

40 GAAGTCAATA CTCTGCCCGG TTTCACGTCA TACAGTCGTT ATCCCCGTAT 1050 

GATGGCCGCT GCAGGTATTG CACTTCCCGA ACTGATTGAC CGCTTGATCG 110 0 

TATTAGCGTT AAAGGGGTGA TAAGCATGGA AATAGGATTT ACTTTTTTAG 1150 

ATGAAATAGT ACACGGTGTT CGTTGGGACG CTAAATATGC CACTTGGGAT 1200 

AATTTCACCG GAAAACCGGT TGACGGGTAT GAAAGTAT^T CGCATTGTAG 12 50 

45 GGACATTCGA ATT 1263 



50 



2) INFORMATION FOR SEQ ID NO: 1054 



(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 1232 bases 

(B) TYPE: Nucleic acid 

(C) STRANDEDNESS: Double 
55 (D) TOPOLOGY: Linear 

(ii) MOLECULE TYPE: Genomic DNA 

(vi) ORIGINAL SOURCE: 
60 (A) ORGANISM: Enterococcus faecalis 
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(B) STRAIN: R610 



PCT/CAOO/01150 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 1054 



10 



15 



20 



25 



30 



CGGCCTATTA 
AAAAACTGTT 
TTGCAATACT 
TCTGCAATAG 
ATACATTGGA 
GCGCGGAATG 
AAAAAAATGC 
CCATGTTGAT 
GATCCATACA 
GATATTCAAA 
TGCGAAAAAT 
ATGATAGGCC 
GCGCGTTCAG 
ATTGGACTAC 
TTGAGCAGGC 
AGTGCCGCGT 
AATCTTTCGT 
CAGTTATAAC 
CAGGAAACGG 
CCGTGTGGAT 
TCAATACTCT 
GCCGCTGCAG 
AGCGTTAAAG 
AATAGTACAC 
TCACCGGAAA 



TNCCGAGCAA 
TGGATTTTGA 
GTTTGGGGGT 
AGATAGCCGC 
ATTACGAAAT 
GGAAAACGAC 
ACGGATTACT 
GTAGCATTTT 
AGGTCTGTTT 
GCTCAGCAAT 
GCTGGGATAG 
GGTGGCAGCT 
GCTCATCCTT 
GCAATTGAAT 
TGTTTCGGGC 
TAGTTGTTGG 
ATTCATCAGG 
CGTTCCCGCA 
CT^AAAA/y^T 
ATGTTTTTAC 
GCCCGGTTTC 
GTATTGCACT 
GGGTGATAAG 
GGTGTTCGTT 
ACCGGTTGAC 



GCGTTGCGTG 
AAGGAGACAG 
TGCTCAGAGG 
TAACATTAAT 
CTGGTGTATG 
AATTGCTATT 
TGTTAAAAAG 
CAGCTTTGCA 
GAATTGTCCG 
TTGTATGGAC 
CTACTCCCGC 
ACGTTTACCT 
CGGTGTGA?^ 
CGGCAAGACA 
TGTGAGGTCG 
CGAGGTGGAC 
AAGTCGAGCC 
GACCTTTCAG 
ATATAT^GCG 
AAGATAACGG 
ACGTCATACA 
TCCCGAACTG 
CATGGAAATA 
GGGACGCTAA 
GGTTATAAGT 



ATACCGTTGA 
GAGCATGAAT 
AGCATGACGT 
AAAGAAAAAT 
GAAAATGTGC 
CAGCTGTACT 
AACCATGAAT 
TGGCAAGTCA 
GTATCCCTTT 
AAATCGTTGA 
CTTTTGGGTT 
ATCCTGTTTT 
AAAGTCAATA 
ATATGACAGC 
GTTGTGCGGT 
CAAATCAGGC 
GGAAAAAGGC 
CAGAGGAGCG 
CTCGGCTGTA 
CCGCATTGTA 
GTCGTTATCC 
ATTGACCGCT 
GGATTTACTT 
ATATGCCACT 
AA 



AAAAACCATT 50 

AGAATAAAAG 100 

ATCGGTAAAA 150 

ACGAGCCGTT 200 

GAAAAACCTT 250 

CTCGCCGGAT 300 

ATGAAATCAA 350 

GGTGAAGATG 400 

TGTAGGCTGC 450 

CATACATCGT 500 

ATTAATAAAG 550 

TGTTAAGCCG 600 

GCGCGGACGA 650 

AAAATCTTAA 700 

ATTGGGAAAC 750 

TGCAGTACGG 800 

TCTGAAAACG 850 

AGGACGGATA 900 

GAGGTCTAGC 950 

CTGAACGAAG 1000 

CCGTATGATG 1050 

TGATCGTATT 1100 

TTTTAGATGA 1150 

TGGGATAATT 12 00 
1232 



35 



40 



2) INFORMATION FOR SEQ ID NO: 1055 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 1218 bases 

(B) TYPE: Nucleic acid 

(C) STRANDEDNESS : Double 

(D) TOPOLOGY: Linear 

(ii) MOLECULE TYPE: Genomic DNA 



45 



(vi) ORIGINAL SOURCE: 

(A) ORGANISM: Enterococcus gsLllinarum 

(B) STRAIN: R684 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 1055 



TACCGAGCAA 
50 TGGATTTTGA 
GTTTGGGGGT 
AGATAGCCGC 
ATTACGAAAT 
GGAAAACGAC 
55 ACGGATTACT 
GTAGCATTTT 
AGGTCTGTTT 
GCTCAGCAAT 
GCTGGGATAG 
60 GGTGGCAGCT 



GCGTTGCGTG 
AAGGAGACAG 
TGCTCAGAGG 
TAACATTAAT 
CTGGTGTATG 
AATTGCTATT 
TGTTAAAAAG 
CAGCTTTGCA 
GAATTGTCCG 
TTGTATGGAC 
CTACTCCCGC 
ACGTTTACCT 



ATACCGTTGA 
GAGCATGAAT 
AGCATGACGT 
AAAGAAAAAT 
GAAAATGTGC 
CAGCTGTACT 
AACCATGAAT 
TGGCAAGTCA 
GTATCCCTTT 
AAATCGTTGA 
CTTTTGGGTT 
ATCCTGTTTT 



AAAAACCATT 
AGAATAAAAG 
ATCGGTAAAA 
ACGAGCCGTT 
GAAAAACCTT 
CTCGCCGGAT 
ATGAAATCAA 
GGTGAAGATG 
TGTAGGCTGC 
CATACATCGT 
ATTAATAAAG 
TGTTAAGCCG 



AAAAACTGTT 50 

TTGCAATACT 100 

TCTGCAATAG 150 

ATACATTGGA 200 

GCGCGGAATG 250 

AAAAAAATGC 300 

CCATGTTGAT 350 

GATCCATACA 400 

GATATTCAAA 4 50 

TGCGAAT^AAT 500 

ATGATAGGCC 550 

GCGCGTTCAG 600 



597 



SUBSTITUTE SHEET (RULE 26) 



wo 01/23604 PCT/CAOO/01 150 

GCTCATCCTT CGGTGTGAAA AAAGTCAATA GCGCGGACGA ATTGGACTAC 650 

GCAATTGAAT CGGCAAGACA ATATGACAGC AAAATCTTAA TTGAGCAGGC 700 

TGTTTCGGGC TGTGAGGTCG GTTGTGCGGT ATTGGGAAAC AGTGCCGCGT 750 

TAGCTGTTGG CGAGGTGGAC CAAATCAGGC TGCAGTACGG AATCTTTCGT 800 

5 ATTCATCAGG AAGTCGAGCC GGAT^AAAGGC TCTGAAAACG CAGTTATAAC 850 

CGTTCCCGCA GACCTTTCAG CAGAGGAGCG AGGACGGATA CAGGAAACGG 900 

CAAAAAAAAT ATATAAAGCG CTCGGCTGTA GAGGTCTAGC CCGTGTGGAT 950 

ATGTTTTTAC AAGATAACGG CCGCATTGTA CTGAACGAAG TCT^TACTCT 1000 

GCCCGGTTTC ACGTCATACA GTCGTTATCC CCGTATGATG GCCGCTGCAG 1050 

10 GTATTGCACT TCCCGAACTG ATTGACCGCT TGATCGTATT AGCGTTAAAG 1100 

GGGTGATAAG CATGGAAATA GGATTTACTT TTTTAGATGA AATAGTACAC 1150 

GGTGTTCGTT GGGACGCTAA ATATGCCACT TGGGATAATT TCACCGGAAA 1200 

ACCGGTTGAC GGTTAGAA 1218 



15 



30 



60 



2) INFORMATION FOR SEQ ID NO: 1056 



(i) SEQUENCE CHARACTERISTICS: 
2 0 (A) LENGTH: 12 65 bases 

(B) TYPE: Nucleic acid 

(C) STRANDEDNESS : Double 

(D) TOPOLOGY: Linear 

25 (ii) MOLECULE TYPE: Genomic DNA 

(vi) ORIGINAL SOURCE: 

(A) ORGANISM: Enterococcus faecixmi 

(B) STRAIN: R688 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 1056 



AATCACACCG CATACGGCCT ATTATACCGA GCAAGCGTTG CGTGATACCG 50 

TTGAAAAAAC CATTAAAAAC TGTTTGGATT TTGAAAGGAG ACAGGAGCAT 100 

35 GAATAGAATA AAAGTTGCAA TACTGTTTGG GGGTTGCTCA GAGGAGCATG 150 

ACGTATCGGT AAAATCTGCA ATAGAGATAG CCGCTAACAT TAATAAAGAA 200 

AAATACGAGC CGTTATACAT TGGAATTACG AAATCTGGTG TATGGAAAAT 250 

GTGCGAAAAA CCTTGCGCGG AATGGGAAAA CGACAATTGC TATTCAGCTG 3 00 

TACTCTCGCC GGATAAAAAA ATGCACGGAT TACTTGTTAA AAAGAACCAT 350 

40 GAATATGAAA TCAACCATGT TGATGTAGCA TTTTCAGCTT TGCATGGCAA 400 

GTCAGGTGAA GATGGATCCA TACAAGGTCT GTTTGAATTG TCCGGTATCC 4 50 

CTTTTGTAGG CTGCGATATT CAT^GCTCAG CAATTTGTAT GGACAAATCG 500 

TTGACATACA TCGTTGCGAA AAATGCTGGG ATAGCTACTC CCGCCTTTTG 550 

GGTTATTAAT AAAGATGATA GGCCGGTGGC AGCTACGTTT ACCTATCCTG 600 

45 TTTTTGTTAA GCCGGCGCGT TCAGGCTCAT CCTTCGGTGT GAAAAAAGTC 650 

AATAGCGCGG ACGAATTGGA CTACGCAATT GAATCGGCAA GACAATATGA 700 

CAGCAAAATC TTAATTGAGC AGGCTGTTTC GGGCTGTGAG GTCGGTTGTG 750 

CGGTATTGGG AAACAGTGCC GCGTTAGCTG TTGGCGAGGT GGACCAAATC 800 

AGGCTGCAGT ACGGAATCTT TCGTATTCAT CAGGAAGTCG AGCCGGAAAA 850 

50 AGGCTCTGAA AACGCAGTTA TAACCGTTCC CGCAGACCTT TCAGCAGAGG 900 

AGCGAGGACG GATACAGGAA ACGGCAAAAA AAATATATAA AGCGCTCGGC 950 

TGTAGAGGTC TAGCCCGTGT GGATATGTTT TTACAAGATA ACGGCCGCAT 1000 

TGTACTGAAC GAAGTCAATA CTCTGCCCGG TTTCACGTCA TACAGTCGTT 1050 

ATCCCCGTAT GATGGCCGCT GCAGGTATTG CACTTCCCGA ACTGATTGAC 1100 

55 CGCTTGATCG TATTAGCGTT AAAGGGGTGA TAAGCATGGA AATAGGATTT 1150 

ACTTTTTTAG ATGAAATAGT ACACGGTGTT CGTTGGGACG CTAAATATGC 1200 

CACTTGGGAT AATTTCACCG GAAAACCGGT TGACGGTTAT GAAGTAAATC 1250 

GCATTGTAGG GACAT 1265 



598 
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PCT/CAOO/01150 



2) INFORMATION FOR SEQ ID NO: 1057 

(i) SEQUENCE CHARACTERISTICS: 
5 (A) LENGTH: 1269 bases 

(B) TYPE: Nucleic acid 

(C) STRANDEDNESS : Double 

(D) TOPOLOGY: Linear 

10 (ii) MOLECULE TYPE: Genomic DNA 

(vi) ORIGINAL SOURCE: 

(A) ORGANISM: EntGrococcus flavescens 

(B) STRAIN: R689 



15 



45 



60 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 1057 



TCACACCGCA TACGGCCTAT TATACCGAGC AAGCGTTGCG TGATACCGTT 50 

GAAAAAACCA TTAAAAACTG TTTGGATTTT GAAAGGAGAC AGGAGCATGA 100 

20 ATAGAATAAA AGTTGCAATA CTGTTTGGGG GTTGCTCAGA GGAGCATGAC 150 

GTATCGGTAA AATCTGCAAT AGAGATAGCC GCTAACATTA ATAAAGAAAA 2 00 

ATACGAGCCG TTATACATTG GAATTACGAA ATCTGGTGTA TGGAAAATGT 250 

GCGAAAAACC TTGCGCGGAA TGGGAAAACG ACAATTGCTA TTCAGCTGTA 300 

CTCTCGCCGG ATAAAAT^^T GCACGGATTA CTTGTTAAAA AGAACCATGA 350 

25 ATATGAAATC AACCATGTTG ATGTAGCATT TTCAGCTTTG CATGGCAAGT 400 

CAGGTGAAGA TGGATCCATA CAAGGTCTGT TTGAATTGTC CGGTATCCCT 450 

TTTGTAGGCT GCGATATTCA AAGCTCAGCA ATTTGTATGG ACAAATCGTT 500 

GACATACATC GTTGCGAAAA ATGCTGGGAT AGCTACTCCC GCCTTTTGGG 550 

TTATTAATAA AGATGATAGG CCGGTGGCAG CTACGTTTAC CTATCCTGTT 600 

30 TTTGTTAAGC CGGCGCGTTC AGGCTCATCC TTCGGTGTGA AAAAAGTCAA 650 

TAGCGCGGAC GAATTGGACT ACGCAATTGA ATCGGCAAGA CAATATGACA 700 

GCAAAATCTT AATTGAGCAG GCTGTTTCGG GCTGTGAGGT CGGTTGTGCG 750 

GTATTGGGAA ACAGTGCCGC GTTAGCTGTT GGCGAGGTGG ACCAAATCAG 800 

GCTGCAGTAC GGAATCTTTC GTATTCATCA GGAAGTCGAG CCGGAAAAAG 850 

35 GCTCTGAAAA CGCAGTTATA ACCGTTCCCG CAGACCTTTC AGCAGAGGAG 900 

CGAGGACGGA TACAGGAAAC GGCAAAAAAA ATATATAAAG CGCTCGGCTG 950 

TAGAGGTCTA GCCCGTGTGG ATATGTTTTT ACAAGATAAC GGCCGCATTG 100 0 

TACTGAACGA AGTCAATACT CTGCCCGGTT TCACGTCATA CAGTCGTTAT 1050 

CCCCGTATGA TGGCCGCTGC AGGTATTGCA CTTCCCGAAC TGATTGACCG 1100 

40 CTTGATCGTA TTAGCGTTAA AGGGGTGATA AGCATGGAAA TAGGATTTAC 1150 

TTTTTTAGAT GAAATAGTAC ACGGTGTTCG TTGGGACGCT AAATATGCCA 1200 

CTTGGGATAA TTTCACCGGA AAACGGGTTG ACGGTTATGA AGTAAATCGC 1250 

ATTGTAGGGA CATACGAAT 1269 



2) INFORMATION FOR SEQ ID NO: 1058 



(i) SEQUENCE CHARACTERISTICS: 
50 (A) LENGTH: 1169 bases 

(B) TYPE: Nucleic acid 

(C) STRANDEDNESS: Double 

(D) TOPOLOGY: Linear 

55 (ii) MOLECULE TYPE: Genomic DNA 

(vi) ORIGINAL SOURCE: 

(A) ORGANISM: Enterococcus gallin^rum 

(B) STRAIN: R420 



599 
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wo 01/23604 PCT/CAOO/01150 
(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 1058 

CAAATTTTCT TTTCTTTTCC TAGGTACACT GAATGTAACC TTAAAAGAAA 50 

AAAGGAAAGG AAGAAAATGA TGAAAAAAAT TGCCGTTTTA TTTGGAGGGA 100 

5 ATTCTCCAGA ATACTCAGTG TCACTAACCT CAGCAGCAAG TGTGATCCAA 150 

GCTATTAACC CGCTGAAATA TGAAGTAATG ACCATTGGCA TCGCACCAAC 200 

AATGGATTGG TATTGGTATC AAGGAAACCT CGCGAATGTT CGCAATGATA 250 

CTTGGCTAGA AGATCACAAA AACTGTCACC AGCTGACTTT TTCTAGCCAA 3 00 

GGATTTATAT TAGGAGAAAA ACGAATCGTC CCTGATGTCC TCTTTCCAGT 3 50 

10 CTTGCATGGG AAGTATGGCG AGGATGGCTG TATCCAAGGA CTGCTTGAAC 400 

TAATGAACCT GCCTTATGTT GGTTGCCATG TCGCTGCCTC CGCATTATGT 450 

ATGAACAAAT GGCTCTTGCA TCAACTTGCT GATACCATGG GAATCGCTAG 50 0 

TGCTCCCACT TTGCTTTTAT CCCGCTATGA AAACGATCCT GCCACAATCG 550 

ATCGTTTTAT TCAAGACCAT GGATTCCCGA TCTTTATCAA GCCGAATGAA 600 

15 GCCGGTTCTT CAAAAGGGAT CACAAAAGTA ACTGACAAAA CAGCGCTCCA 650 

ATCTGCATTA ACGACTGCTT TTGCTTACGG TTCTACTGTG TTGATCCAAA 700 

AGGCGATAGC GGGTATTGAA ATTGGCTGCG GCATCTTAGG AAATGAGCAA 750 

TTGACGATTG GTGCTTGTGA TGCGATTTCT CTTGTCGACG GTTTTTTTGA 800 

TTTTGAAGAG AAATACCAAT TAATCAGCGC CACGATCACT GTCCCAGCAC 850 

20 CATTGCCTCT CGCGCTTGAA TCACAGATCA AGGAGCAGGC ACAGCTGCTT 900 

TATCGAAACT TGGGATTGAC GGGTCTGGCT CGAATCGATT TTTTCGTCAC 950 

CAATCAAGGA GCGATTTATT TAAACGAAAT CAACACCATG CCGGGATTTA 1000 

CTGGGCACTC CCGCTACCCA GCTATGATGG CGGAAGTCGG GTTATCCTAC 1050 

GAAATATTAG TAGAGCAATT GATTGCACTG GCAGAGGAGG ACAAACGATG 1100 

25 AACACATTAC AATTGATCAA TAAAAACCAT CCATTGAAAA AAAATCAAGA 1150 

GCCCCCGCAC TTAGTGCTA 1169 



30 2) INFORMATION FOR SEQ ID NO: 1059 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 1166 bases 

(B) TYPE: Nucleic acid 
35 (C) STRANDEDNESS: Double 

(D) TOPOLOGY: Linear 

(ii) MOLECULE TYPE: Genomic DNA 

40 (vi) ORIGINAL SOURCE: 

(A) ORGANISM: Enterococcus gallinarum 

(B) STRAIN: R631 



45 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 1059 



CAAATTTTCT TTTCTTTTCC TAGGTACACT GAATGTAACC TTAAAAGAAA 50 

AAAGGAAAGG AAGAAAATGA TGAAAAAAAT TGCCGTTTTA TTTGGAGGGA 100 

ATTCTCCAGA ATACTCAGTG TCACTAGCCT CAGCAGCAAG TGTGATCCAA 150 

GCTATTGACC CGCTGAAATA TGAAGTAATG ACCATTGGCA TCGCACCAAC 2 00 

50 AATGGATTGG TATTGGTATC AAGGAAACCT CGCGAATGTT CGCAATGATA 2 50 

CTTGGCTAGA AGATCACAAA AACTGTCACC AGCTGACTTT TTCTAGCCAA 3 00 

GGATTTATAT TAGGAGAAAA ACGAATCGTC CCTGATGTCC TCTTTCCAGT 350 

CTTGCATGGG AAGTATGGCG AGGATGGCTG TATCCAAGGA CTGCTTGAAC 400 

TAATGAACCT GCCTTATGTT GGTTGCCATG TCGCTGCCTC CGCATTATGT 4 50 

55 ATGAACAAAT GGCTCTTGCA TCAACTTGCT GATACCATGG GAATCGCTAG 500 

TGCTCCCACT TTGCTTTTAT CCCGCTATGA AAACGATCCT GCCACAATCG 550 

ATCGTTTTAT TCAAGACCAT GGATTCCCGA TCTTTATCAA GCCGAATGAA 600 

GCCGGTTCTT CAAAAGGGAT CACAAAAGTA ACTGACAAAA CAGCGCTCCA 650 

ATCTGCATTA ACGACTGCTT TTGCTTACGG TTCTACTGTG TTGATCCAAA 700 

60 AGGCGATAGC GGGTATTGAA ATTGGCTGCG GCATCTTAGG AAATGAGCAA 750 

600 
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wo 01/23604 

TTGACGATTG GTGCTTGTGA TGCGATTTCT 
TTTTGAAGAG AAATACCAAT TAATCAGCGC 
CATTGCCTCT CGCGCTTGAA TCACAGATCA 
TATCGAAACT TGGGATTGAC GGGTCTGGCT 
CAATCAAGGA GCGATTTATT TAAACGAAAT 
CTGGGCACTC CCGCTACCCA GCTATGATGG 
GAAATATTAG TAGAGCAATT GATTGCACTG 
AACACATTAC AATTGATCAA TAAAAACCAT 
GCCCCCGCAC TTAGTG 



PCT/CAOO/01150 

CTTGTCGACG GTTTTTTTGA 800 

CACGATCACT GTCCCAGCAC 850 

AGGAGCAGGC ACAGCTGCTT 900 

CG7ATCGATT TTTTCGTCAC 950 

CAACACCATG CCGGGATTTA 1000 

CGGAAGTCGG GTTATCCTAC 1050 

GCAGAGGAGG ACAAACGATG 1100 

CCATTGAAAA AAT^TCAAGA 1150 

1166 



2) INFORMATION FOR SEQ ID NO: 1060 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 102 8 bases 

(B) TYPE: Nucleic acid 

(C) STRANDEDNESS : Double 

(D) TOPOLOGY: Linear 

(ii) MOLECULE TYPE: Genomic DNA 

(vi) ORIGINAL SOURCE: 

(A) ORGANISM: Enterococcus casseliflavus 

(B) STRAIN: ATCC 25788 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 1060 

AACATGAAAA AAATCGCCTT ATTTTTGGAG GCAATTCACC GGAATACACC 50 

GTTTCTTTAG CTTCAGCAAC TAGCGCAATC GAAGCACTCC AATCATCTCC 100 

CTATGACTAC GACCTCTCTT TGATCGGGAT CGCCCCAGAT GCTATGGATT 150 

GGTACTTGTA TACAGGAGAA CTGGAAAACA TCCGACAAGA CACGTGGTTG 200 

TTGGATACGA AACATAAACA GAAAATACAG CCGCTATTCG AAGGAAACGG 250 

CTTTTGGCTA AGTGAAGAGC AGCAAACGTT GGTACCTGAT GTTTTATTTC 3 00 

CCATTATGCA TGGCAAATAC GGGGAAGATG GCAGTATCCA AGGATTGTTT 350 

GAATTGATGA AGCTGCCTTA TGTAGGCTGC GGGGTGGCAG GTTCTGCCTT 400 

ATGTATGAAC AAATGGCTGC TGCATCAAGC TGCAGCAGCC ATTGGCGTAC 450 

AAAGTGCTCC TACGATTCTC TTGACAAATC AAGCCAACCA GCAAGAACAA 500 

ATCGAAGCTT TTATCCAGAC CCATGGCTTC CCAGTTTTCT TTAAGCCTAA 550 

TGAAGCGGGC TCCTCAAAAG GGATCACTAA AGTCACCTGC GTTGAAGAAA 600 

TCGCTTCTGC CTTAAAAGAA GCCTTTACTT ATTGTTCCGC AGTGCTCCTA 650 

CAAAAAAATA TTGCCGGTGT TGAGATCGGT TGCGGTATTT TGGGCAACGA 700 

CTCTTTGACT GTCGGTGCTT GTGACGCCAT TTCATTAGTA GACGGCTTTT 75 0 

TCGATTTTGA AGAAAAGTAC CAGCTGATCA GCGCCAAAAT CACCGTCCCT 800 

GCGCCATTGC CTGAAACGAT TGAT^CCAAG GTCAAAGAAC AAGCTCAGCT 850 

GCTCTATCGT AGTCTTGGTC TTAAAGGTCT TGCTCGCATC GACTTTTTTG 900 

TCACGGAGCG AGGAGAACTA TACTTGAATG AAATCAATAC TATGCCGGGC 950 

TTTACGAGTC ACTCCCGCTA TCCTGCCATG ATGGCAGCGG TCGGCTTATC 1000 

CTATCAAGAA CTACTACAAA AACTGCTT 1028 



2) INFORMATION FOR SEQ ID NO: 1061 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 103 0 bases 

(B) TYPE: Nucleic acid 

(C) STRANDEDNESS: Double 

(D) TOPOLOGY: Linear 

601 
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(ii) MOLECULE TYPE: Genomic DNA 



PCT/CAOO/01150 



(vi) ORIGINAL SOURCE: 

(A) ORGANISM: Enterococcus casseliflavxis 
5 (B) STRAIN: R689 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 1061 

AATATGAAAA AAATCGCCTT ATTTTTGGAG GCAATTCACC GGAGTACGCC 50 

10 GTTTCTTTAG CCTCAGCAAC TAGCGCAATC GAAGCACTCC AATCATCTCC 100 

CGATGACTAT GACCTCTCTT TGATCGGGAT CGCCCCAGAT GCTATGGATT 150 

GGTATTTGTA TACAGGAGAA CTGGAAAACA TCCGACJ\AGA CACGTGGTTG 2 00 

TTGGATACGA AACATAAACA GAAAATCCAG CCGCTTTTTG AAGGAAACGG 250 

CTTTTGGCTA AGTGAAGAGC AACAAACGTT GGTTCCTGAT GTTTTATTTC 300 

15 CCATTATGCA TGGCAAATAC GGGGAAGATG GCAGTATCCA AGGATTGTTT 350 

GAATTGATGA AACTACCTTA TGTAGGCTGC GGGGTGGCAG CCTCTGCCTT 4 00 

ATGTATGAAC AAATGGCTGC TGCATCAAGC AGCAGAAGCG ATTGGCGTAC 45 0 

AAAGTGCTCC TACGATTCTC TTGACAAATC AAGCCAACCA GCAAGATCAA 500 

ATCGAAGCTT TTATCCAGAC CCATGGCTTT CCGGTTTTTT TTAAGCCTAA 550 

20 TGAAGCGGGC TCCTCAAAAG GGATCACTT^A AGTCACCTGC GTTGAAGAAA 600 

TCGCTTCTGC CTTAAAAGAA GCCTTTACTT ATTGTTCAGC AGTGCTCCTA 650 

CAAAAAAATA TTGCCGGTGT TGAGATCGGT TGCGGTATTT TGGGCAACGA 700 

CTCTTTGACT GTCGGTGCTT GTGACGCTAT TTCATTAGTA GACGGCTTTT 750 

TCGATTTTGA AGAAAAGTAC CAGCTGATCA GCGCCAAGAT CACCGTTCCT 800 

25 GCACCATTGC CTGAAACGAT TGAAACCAAG GTCAAAGAAC AAGCTCAGCT 850 

GCTCTATCGT AGTCTTTGTC TTAAAGGTCT TGCTCGCATC GACTTTTTTG 90 0 

TCACGGATCA AGGAGAACTA TACTTGAATG AAATCAATAC TATGCCGGGC 950 

TTTACGAGTC ACTCCCGCTA TCCTGCCATG ATGGCAGCGA TCGGCTTATC 100 0 

CTATCAAGAA CTACTACAAA AACTGCTTGT 103 0 

30 



2) INFORMATION FOR SEQ ID NO: 1062 

35 (i) SEQUENCE CHT^RACTERISTICS : 

(A) LENGTH: 1031 bases 

(B) TYPE: Nucleic acid 

(C) STRANDEDNESS : Double 

(D) TOPOLOGY: Linear 



40 



(ii) MOLECULE TYPE: Genomic DNA 



(vi) ORIGINAL SOURCE: 

(A) ORGANISM: Enteracoccus csLSseliflavxis 
45 (B) STRAIN: R754 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 1062 

AAACATGAAA AA7!U\TCGCCA TTATTTTTGG AGGCAATTCA CCGGAATACA 50 

50 CCGTTTCTTT AGCTTCAGCA ACTAGCGCAA TCGAAGCACT CCAATCATCT 100 

CCCTATGACT ACGACCTCTC TTTGATCGGG ATCGCCCCAG ATGCTATGGA 15 0 

TTGGTACTTG TATACAGGAG AACTGGAAT^^ CATCCGACAA GACACGTGGT 200 

TGTTGGATAC GAAACATAAA CAGAAAATAC AGCCGCTATT CGAAGGAAAC 250 

GGCTTTTGGC TAAGTGAAGA GCAGCAAACG TTGGTACCTG ATGTTTTATT 300 

55 TCCCATTATG CATGGCAAAT ACGGGGAAGA TGGCAGTATC CAAGGATTGT 350 

TTGAATTGAT GAAGCTGCCT TATGTAGGCT GCGGGGTGGC AGGTTCTGCC 4 00 

TTATGTATGA ACAAATGGCT GCTGCATCAA GCTGCAGCAG CCATTGGCGT 450 

ACAAAGTGCT CCTACGATTC TCTTGACAAA TCAAGCCAAC CAGCAAGAAC 50 0 

AAATCGAAGC TTTTATCCAG ACCCATGGCT TCCCAGTTTT CTTTAAGCCT 550 

60 AATGAAGCGG GCTCCTCAAA AGGGATCACT AAAGTCACCT GCGTTGAAGA 600 

602 
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PCT/CAOO/01150 



AATCGCTTCT GCCTTAAAAG 
TACAAAAAAA TATTGCCGGT 
GACTCTTTGA CTGTCGGTGC 
TTTCGATTTT GAAGAAAAGT 
CTGCGCCATT GCCTGAAACG 
CTGCTCTATC GTAGTCTTGG 
TGTCACGGAG CGAGGAGAAC 
GCTTTACGAG TCACTCCCGC 
TCCTATCAAG AACTACTACA 



AAGCCTTTAC TTATTGTTCC GCAGTGCTCC 
GTTGAGATCG GTTGCGGTAT TTTGGGCAAC 
TTGTGACGCC ATTTCATTAG TAGACGGCTT 
ACCAGCTGAT CAGCGCCAAA ATCACCGTCC 
ATTGAAACCA AGGTCAAAGA ACAAGCTCAG 
TCTTAAAGGT CTTGCTCGCA TCGACTTTTT 
TATACTTGAA TGAAATCAAT ACTATGCCGG 
TATCCTGCCA TGATGGCAGC GGTCGGCTTA 
AAAACTGCTT G 



650 
700 
750 
800 
850 
900 
950 
1000 
1031 



2) INFORMATION FOR SEQ ID NO: 1063 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 1030 bases 

(B) TYPE: Nucleic acid 

(C) STRANDEDNESS : Double 

(D) TOPOLOGY: Linear 

(ii) MOLECULE TYPE: Genomic DNA 

(vi) ORIGINAL SOURCE: 

(A) ORGANISM: Enterococcus casselifl&vus 

(B) STRAIN: R775 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 1063 

AACATGAAAA AAATCGCCAT TATTTTTGGA GGCAATTCAC CGGAATACAC 50 

CGTTTCTTTA GCTTCAGCAA CTAGCGCAAT CGAAGCACTC CAATCATCTC 100 

CCTATGACTA CGACCTCTCT TTGATCGGGA TCGCCCCAGA TGCTATGGAT 150 

TGGTACTTGT ATACAGGAGA ACTGGAAAAC ATCCGACAAG ACACGTGGTT 200 

GTTGGATACG AAACATAAAC AGT^TU^TACA GCCGCTATTT GAAGGAAACG 2 50 

GCTTTTGGCT AAGTGAAGAG CAGCAAACGT TAGTACCTGA TATTTTATTT 3 00 

CCCATTATGC ATGGCAAATA CGGGGAAGAT GGCAGTATCC AAGGATTGTT 3 50 

TGAATTGATG AAACTACCTT ATGTAGGTTG CGGGGTGGCA GGTTCTGCCT 400 

TATGTATGAA CAAATGGCTG CTGCATCAAG CTGCAGCAGC CATTGGCGTA 450 

CAAAGTGCTC CTACGATTCT CTTGACAAAT CAAGCCAACC AGCAAGAACA 500 

AATCGAAGCT TTTATCCAGA CCCATGGCTT CCCAGTTTTC TTTAAGCCTA 550 

ATGAAGCGGG CTCTTCAAAA GGGATCACTA AAGTCACCTG CGTTGAAGAA 600 

ATCGCTTCTG CCTTAAAAAA AGCCTTTACT TATTGTTCCG CAGTGCTCCT 650 

ACAAAAAAAT ATTGCCGGTG TTGAGATCGG TTGCGGTATT TTGGGCAACG 700 

ACTCTTTGAC TGTCGGTGCT TGTGACGCCA TTTCATTAGT AGACGGCTTT 750 

TTCGATTTTG AAGAAAAGTA CCAGCTGATC AGCGCCAAAA TCACCGTCCC 800 

TGCGCCATTG CCTGAAACGA TTGAT^CCAA GGTCAAAGAA CAAGCTCAGC 850 

TGCTCTATCG TAGTCTTGGT CTTAAAGGTC TTGCTCGCAT CGACTTTTTT 900 

GTCACGGATC AAGGAGAACT ATACTTGAAT GAAATCAATA CTATGCCGGG 950 

CTTTACGAGT CACTCCCGTT ATCCTGCCAT GATGGCAGCG GTCGGCTTAT 1000 

CCTATCAAGA ACTACTACAA AAACTGCTTG 1030 



2) INFORMATION FOR SEQ ID NO: 1064 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 1032 bases 

(B) TYPE: Nucleic acid 

(C) STRANDEDNESS: Double 

(D) TOPOLOGY: Linear 
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(ii) MOLECULE TYPE: Genomic DNA 



PCT/CAOO/01150 



(vi) ORIGINAL SOURCE: 

(A) ORGANISM: EntGrococcus flavescens 

(B) STRAIN: ATCC 49996 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 1064 

AAACATGA/y^ AAAATCGCCA TTATTTTTGG AGGCAATTCA CCGGAATACA 50 

CCGTTTCTTT AGCTTCAGCA ACTAGCGCAA TCGAAGCACT CCAATCATCT 100 

CCCTATGACT ACGACCTCTC TTTGATCGGG ATCGCCCCAG ATGCTATGGA 150 

TTGGTACTTG TATACAGGAG AACTGGAAAA CATCCGACAA GACACGTGGT 2 00 

TGTTGGATAC GAAACAGAAA CAGAAAATAC AGCCGCTATT CGAAGGAAAC 2 50 

GGCTTTTGGT TAAGTGAAGA GCAGCAAACG TTGGTACCTG ATGTTTTATT 3 00 

TCCCATTATG CATGGCAAAT ACGGGGAAGA TGGCAGTATC CAAGGATTGT 3 50 

TTGAATTGAT GAAGCTACCT TATGTAGGCT GCGGGGTGGC AGGTTCTGCC 4 00 

TTATGTATGA ACAAATGGTT GCTGCATCAA GCTGCAGCAG CCATTGGCGT 4 50 

ACAAAGTGCT CCTACGATTC TCTTGACAAA TCACGCCAAC CAGCAAGAAC 500 

AAATCGAAGC TTTTATCCAG ACCCATGGCT TTCCAGTTTT CTTTAAGCCT 550 

AATGAAGCGG GTTCCTCAAA AGGGATCACT AAAGTCACCT GCGTTGAAGA 600 

AATCGCTTCT GCCTTAAAAG AAGCCTTTAC TTATTGTTCC GCAGTGCTCC 650 

TACAAAAAAA TATTGCCGGT GTTGAGATCG GTTGCGGTAT TTTGGGCAAC 700 

GACTCTTTGA CTGTCGGTGC TTGTGACGCC ATTTCATTAG TAGACGGCTT 750 

TTTCGATTTT GAAGAAAAGT ACCAGCTGAT CAGCGCCAAA ATCACCGTCC 800 

CTGCGCCATT GCCTGAAACG ATTGAAACTA AGGTCAAAGA ACAAGCTCAG 850 

CTGCTCTATC GTAGTCTTGG ACTTAAAGGT CTTGCTCGCA TCGACTTTTT 900 

TGTCACGGAT CAAGGAGAAC TATACTTGAA TGAAATCAAT ACTATGCCGG 950 

GCTTTACGAG TCACTCCCGC TATCCTGCCA TGATGGCAGC GGTCGGGTTA 10 00 

TCCTATCAAG AACTACTACA AAAACTACTT GT 1032 



2) INFORMATION FOR SEQ ID NO: 1065 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 1034 bases 

(B) TYPE: Nucleic acid 

(C) STRANDEDNESS : Double 

(D) TOPOLOGY: Linear 

(ii) MOLECULE TYPE: Genomic DNA 

(vi) ORIGINAL SOURCE: 

(A) ORGANISM: JSnterococcus flavescens 

(B) STRAIN: R758 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO : 1065 

AAAAACATGA AAAAAATCGC CATTATTTTT GGAGGCAATT CACCGGAATA 50 

CACCGTTTCT TTAGCTTCAG CAACTAGCGC AATCGAAGCA CTCCAATCAT 100 

CTCCCTATGA CTACGACCTC TCTTTGATCG GGATCGCCCC AGATGCTATG 150 

GATTGGTACT TGTATACAGG AGAACTGGAA AACATCCGAC AAGACACGTG 2 00 

GTTGTTGGAT ACGAAACATA AACAGAAAAT ACAGCCGCTA TTCGAAGGAA 250 

ACGGCTTTTG GCTAAGTGAA GAGCAGCAAA CGTTGGTACC TGATGTTTTA 300 

TTTGCCATTA TGCATGGCAA ATACGGGGAA GATGGCAGTA TCCAAGGATT 350 

GTTTGAATTG ATGAAGCTGC CTTATGTAGG CTGCGGGGTG GCAAGTTCTG 400 

CCTTATGTAT GAACAAATGG CTGCTGCATC AAGCTGCAGC AGCCATTGGC 4 50 

GTACAAAGTG CTCCTACGAT TCTCTTGACA AATCAAGCCA ACCAGCAAGA 500 

ACAAATCGAA GCTTTTATCC AGACCCATGG CTTTCCAGTT TTCTTTAAGC 550 

CTAATGAAGC GGGCTCCTCA AAAGGGATCA CTAAAGTCAC CTGCGTTGAA 600 

604 



SUBSTITUTE SHEET (RULE 26) 



wo 01/23604 

GAAATCGCTT CTGCCTTAAA 
CCTACAi\AAA AATATTGCCG 
ACGACTCTTT GACTGTCGGT 
TTTTTCGATT TTGAAGAAAA 
5 CCCTGCGCCA TTGCCTGAAA 
AGCTGCTCTA TCGTAGTCTT 
TTTGTCACGG ATCAAGGAGA 
GGGCTTTACG AGTCACTCCC 
TATCCTATCA AGAACTACTA 



AGAAGCCTTT ACTTATTGTT 
GTGTTGAGAT CGGTTGCGGT 
GCTTGTGACG CCATTTCATT 
GTACCAGCTG ATCAGCGCCA 
CGATTGAAAC CAAGGTCAAA 
GGTCTTAAAG GTCTTGCTCG 
ACTATACTTG AATGAAATCA 
GCTATCCTGC CATGATGGCA 
CAAAAACTGC TTGT 



PCT/CAOO/01150 

CCGCAGTGCT 650 

ATTTTGGGCA 700 

AGTAGACGGC 750 

AAATCACCGT 800 

GAACAAGCTC 850 

CATCGACTTT 900 

ATACTATGCC 950 

GCGGTCGGCT 100 0 
1034 



2) INFORMATION FOR SEQ ID NO: 1066 

15 (i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 1012 bases 

(B) TYPE: Nucleic acid 

(C) STRANDEDNESS : Double 

(D) TOPOLOGY: Linear 



20 



(ii) MOLECULE TYPE: Genomic DNA 



(vi) ORIGINAL SOURCE: 

(A) ORGANISM: Enterococcus flavescens 
25 (B) STRAIN: R760 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 1066 

CATGAAAAAA ATCGCCATTA TTTTTGGAGG CAATTCACCG GAATACACCG 5 0 

30 TTTCTTTAGC CTCAGCAACT AGCGCAATCG AAGCACTCCA ATCATCTCCC 10 0 

TATGATTACG ACCTCTCTTT GATCGGGATC GCCCCAGATG CTATGGATTG 150 

GTACTTGTAT ACAGGAGAAC TGGAAAACAT CCGACAAGAC ACGTGGTTGT 200 

TGGATACGAA ACATACACAG T^AAATCCAGC CACTTTTTGA AGGAAACGGC 250 

TTTTGGATAA GTGAAGCGCA GCAAACGTTG GTACCTGATG TTTTATTTCC 3 00 

35 CATTATGCAT GGTAAATACG GGGAAGATGG CAGTATCCAA GGATTGTTTG 350 

AATTGATGAA GCTGCCTTAT GTAGGCTGTG GGGTGGCAGC CTCTGCCTTA 4 00 

TGTATGAACA AATGGTTATT GCATCAAGCA GCAGCAGCGA TTGGCGTACA 450 

AAGCGCTCCT ACGATTCTCT TGACAAATCA AGCCAACCAG CAAAGACAAA 500 

TCGAAGCCTT TATCCAGACC CATGGCTTTC CAGTTTTCTT TAAGCCTAAT 550 

40 GAAGCGGGCT CCTCAAAAGG GATCAC7VAAA GTAACTTGTG TTGAAGAAAT 600 

CGCTCCTGCC TTGAAGGAAG CCTTCGCTTA TTGTTCCGCA GTGCTCTTAC 65 0 

AAAAAAATAT CGCTGGCGTT GAGATTGGTT GCGGTATCTT AGGCAACGAC 70 0 

TCTTTGACTG TCGGTGCTTG TGACGCTATT TCATTAGTAG ACGGCTTTTT 750 

CGATTTTGAA GAAAAGTACC AGTTGATCAG CGCCAAAATC ACCGTTCCTG 800 

45 CGCCATTGCC TGAAACGATT GAAACCAAAG TCAAAGAACA AGCTCAGCTG 850 

CTCTATCACA GTCTTGGTCT TAAAGGACTT GCTCGCATCG ACTTTTTTGT 900 

CACGGATCAA GGAGAACTAT ACTTGAATGA AATCAATACT ATGCCGGGCT 950 

TTACGAGTCA CTCCCGCTAT CCTGCCATGA TGGCAGCGGT CGGCTTATCC 1000 

TATCAAGAAT TA 1012 

50 



2) INFORMATION FOR SEQ ID NO: 1067 

55 (i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 721 bases 

(B) TYPE: Nucleic acid 

(C) STRANDEDNESS: Double 

(D) TOPOLOGY: Linear 



605 



SUBSTITUTE SHEET (RULE 26) 



wo 01/23604 

(ii) MOLECULE TYPE: Genomic DNA 



PCT/CAOO/01150 



(vi) ORIGINAL SOURCE: 

(A) ORGANISM: Enterococcus faeciunt 

(B) STRAIN: R481 



(xi) SEQUENCE DESCRIPTION: SEQ 



ID NO: 1067 



CTTACGCTTT 
TTGATTTTAT 
AATGAAGCGC 
GTTTGAAGCA 
CATACCCCAA 
TGCACGGACA 
TGTAACTGGT 
AAGGCAGAGC 
CCTGATAATA 
TGTCAACAGG 
AAAATTCATC 
TGACTGTAAT 
GTTTAAGGGG 
AGGAGGAGAC 
CTTAATATAC 



ATCGATTAGA 
GGATGAACGC 
AAAATCGCAG 
TATAGCCTCG 
TAGCTATTTT 
AACTATATAA 
TCTTAGGGAA 
GATATTGCGG 
AGACTGATCG 
CAGTTCAGCC 
TACATTGGTG 
TTACGGGGCA 
AAGATTCTAG 
TGAAAATGAA 
TTAGGTTATT 



CACGGGNAGC 
TCTCATCATG 
ACGTTTGCGC 
AATGGTGGCA 
GATTTCCCCG 
GCTAACTCTT 
TTTATATATA 
TCATTATCTG 
CATAGAGGGG 
TCGTTT^AATT 
ATAATAGTAA 
AAACGGCACA 
AAATATTTCA 
GAAGTTGTTT 
G 



TTGTCCAATG GGRAGCCGAT 
CGGCAAATGG AATATCATGC 
TCCATCATGG AAAACAGTGG 
CTATGTATTA AGAGACGAAC 
TTAAATAAAC TTTTAACCGT 
TCGGCAGGAA ACCCGACGTA 
GTAGATAGTA TTGAAGATGT 
CGTGCGCTGC GGCAAGATAG 
TGGTATTTCA CACCGCCCAT 
CAGCATGGGT ATCACTTATG 
ATCCAGTAGG GCGAAATAAT 
ATCTCAAACG AGATTGTGCC 
TACTTCCAAC TATATAGTTA 
TTTTTATTGT TATTGTTATT 



50 
100 
150 
200 
250 
300 
350 
400 
450 
500 
550 
600 
650 
700 
721 



2) INFORMATION FOR SEQ ID NO: 1068 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 668 bases 

(B) TYPE: Nucleic acid 

(C) STRANDEDNESS : Double 

(D) TOPOLOGY: Linear 

(ii) MOLECULE TYPE: Genomic DNA 

(vi) ORIGINAL SOURCE: 

(A) ORGANISM: Enterococcus faecium 

(B) STRAIN: R492 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 1068 

ATTTTTAAGG ATGAACGCTC TTCATCATGC GGCAAATGGA ATATCATGCA 50 

ATGAAGCGCA AAATCGCAGA CGTTTGCGCT CCATCATGGA AAACAGTGGG 100 

TTTGAAGCAT ATAGCCTCGA ATGGTGGCAC TATGTATTAA GAGACGAACC 150 

ATACCCCAAT AGCTATTTTG ATTTCCCCGT TAAATAAACT TTTAACCGTT 2 00 

GCACGGACAA ACTATATAAG CTAACTCTTT CGGCAGGAAA CCCGACGTAT 250 

GTAACTGGTT CTTAGGGAAT TTATATATAG TAGATAGTAT TGAAGATGTA 300 

AGGCAGAGCG ATATTGCGGT CATTATCTGC GTGCGCTGCG GCAAGATAGC 350 

CTGATAATAA GACTGATCGC ATAGAGGGGT GGTATTTCAC ACCGCCCATT 4 00 

GTCAACAGGC AGTTCAGCCT CGTTAAATTC AGCATGGGTA TCACTTATGA 4 50 

AAATTCATCT ACATTGGTGA TAATAGTAAA TCCAGTAGGG CGAAATAATT 500 

GACTGTAATT TACGGGGCAA AACGGCACAA TCTCAT^CGA GATTGTGCCG 550 

TTTAAGGGGA AGATTCTAGA AATATTTCAT ACTTCCAACT ATATAGTTAA 600 

GGAGGAGACT GAAAATGAAG AAGTTGTTTT TTTTATTGTT ATTGTTATTC 650 

TTAATATACT TAGGTTAT 668 



2 ) INFORMATION FOR SEQ ID NO: 1069 



606 



SUBSTITUTE SHEET (RULE 26) 



wo 01/23604 



PCT/CAOO/01150 



(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 760 bases 

(B) TYPE: Nucleic acid 

5 (C) STRANDEDNESS : Double 

(D) TOPOLOGY: Linear 

(ii) MOLECULE TYPE: Genomic DNA 

10 (vi) ORIGINAL SOURCE: 

(A) ORGT^ISM: Enterococcus faecium 

(B) STRAIN: R581 



15 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 1069 



CGGCAAGTGC CATTGATCTT ACGCTTTATC GATTAGACAC GGGTRAGCTT 50 

GTACCAATGG GAAGCCGATT TGATTTTATG GATGAACGCT CTCATCATGC 10 0 

GGCAAATGGA ATATCATGCA ATGAAGCGCA AAATCGCAGA CGTTTGCGCT 150 

CCATCATGGA AAACAGTGGG TTTGAAGCAT ATAGCCTCGA ATGGTGGCAC 200 

20 TATGTATTAA GAGACGAACC ATACCCCAAT AGCTATTTTG ATTTCCCCGT 250 

TAAATAAACT TTTAACCGTT GCACGGACAA ACTATATAAG CTAACTCTTT 300 

CGGCAGGAAA CCCGACGTAT GTAACTGGTT CTTAGGGAAT TTATATATAG 3 50 

TAGATAGTAT TGAAGATGTA AGGCAGAGCG ATATTGCGGT CATTATCTGC 400 

GTGCGCTGCG GCAAGATAGC CTGATAATT^ GACTGATCGC ATAGAGGGGT 450 

25 GGTATTTCAC ACCGCCCATT GTCAACAGGC AGTTCAGCCT CGTTAAATTC 500 

AGCATGGGTA TCACTTATGA AAATTCATCT ACATTGGTGA TAATAGTAAA 550 

TCCAGTAGGG CGAAATAATT GACTGTAATT TACGGGGCAA AACGGCACAA 600 

TCTCAAACGA GATTGTGCCG TTTAAGGGGA AGATTCTAGA AATATTTCAT 650 

ACTTCCAACT ATATAGTTAA GGAGGAGACT GAAAATGAAG AAGTTGTTTT 70 0 

30 TTTTATTGTT ATTGTTATTC TTAATATACT TAGGTTATGA CTACGTTAAT 750 

GAAANCCTGA 760 



35 2 ) INFORMATION FOR SEQ ID NO: 1070 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 8 01 bases 

(B) TYPE: Nucleic acid 
40 (C) STRANDEDNESS: Double 

(D) TOPOLOGY: Linear 

(ii) MOLECULE TYPE: Genomic DNA 

45 (vi) ORIGINAL SOURCE: 

(A) ORGANISM: Enterococcus faecalis 

(B) STRAIN: R610 



50 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 1070 



AAAAGGAATA CGGGGCTTTC AAAAATCCAA GCCATAACCC GCGGGCAAGT 50 

GCCATTTGAT TCTTACGCTT TAATCGATTA GACACGGGTA AGCTTGTACC 100 

AATGGGGAAC CGATTTGATT TTAATGNATG AACGCTCTTC ATCATGCGGC 150 

AAATGGAATA TCATGCAATG AAGCGCAAAA TCGCAGACGT TTGCGCTCCA 2 00 

55 TCATGGAAAA CAGTGGGTTT GAAGCATATA GCCTCGAATG GTGGCACTAT 250 

GTATTAAGAG ACGAACCATA CCCCAATAGC TATTTTGATT TCCCCGTTAA 300 

ATA/^CTTTT AACCGTTGCA CGGACAAACT ATATAAGCTA ACTCTTTCGG 350 

CAGGAAACCC GACGTATGTA ACTGGTTCTT AGGGAATTTA TATATAGTAG 400 

ATAGTATTGA AGATGTT^GG CAGAGCGATA TTGCGGTCAT TATCTGCGTG 450 

60 CGCTGCGGCA AGATAGCCTG ATAATAAGAC TGATCGCATA GAGGGGTGGT 500 

607 



SUBSTITUTE SHEET (RULE 26) 



wo 01/23604 

ATTTCACACC GCCCATTGTC 
ATGGGTATCA CTTATGAAAA 
AGTAGGGCGA AATAATTGAC 
CAAACGAGAT TGTGCCGTTT 
5 TCCAACTATA TAGTTAAGGA 
TTATTGTTAT TGTTATTCTT 
A 



AACAGGCAGT TCAGCCTCGT 
TTCATCTACA TTGGTGATAA 
TGTAATTTAC GGGGCAAAAC 
AAGGGGAAGA TTCTAGAAAT 
GGAGACTGAA AATGAAGAAG 
AATATACTTA GGTTATGACT 



PCT/CAOO/01150 

TAAATTCAGC 550 

TAGTAAATCC 600 

GGCACAATCT 650 

ATTTCATACT 700 

TTGTTTTTTT 750 

ACGTTAATGA 80 0 

801 



10 



20 



2) INFORMATION FOR SEQ ID NO: 1071 



(i) SEQUENCE CHARACTERISTICS: 
(A) LENGTH: 711 bases 
15 (B) TYPE: Nucleic acid 

(C) STRANDEDNESS : Double 

(D) TOPOLOGY: Linear 



(ii) MOLECULE TYPE : Genomic DNA 

(vi) ORIGINAL SOURCE: 

(A) ORGANISM: Entejrococcus gallinarum 

(B) STRAIN: R684 

25 (xi) SEQUENCE DESCRIPTION: SEQ ID NO: 1071 

TTGTACCAAT GGGGAGCCGA TTTGATTTTA TGGATGAACG CTCTCATCAT 50 

GCGGCAAATG GAATATCATG CAATGAAGCG CAAAATCGCA GACGTTTGCG 100 

CTCCATCATG GAAAACAGTG GGTTTGAAGC ATATAGCCTC GAATGGTGGC 150 

30 ACTATGTATT AAGAGACGAA CCATACCCCA ATAGCTATTT TGATTTCCCC 2 00 

GTTAAATAAA CTTTTAACCG TTGCACGGAC AAACTATATA AGCTAACTCT 250 

TTCGGCAGGA AACCCGACGT ATGTAACTGG TTCTTAGGGA ATTTATATAT 300 

AGTAGATAGT ATTGAAGATG TAAGGCAGAG CGATATTGCG GTCATTATCT 350 

GCGTGCGCTG CGGCAAGATA GCCTGATAAT AAGACTGATC GCATAGAGGG 400 

35 GTGGTATTTC ACACCGCCCA TTGTCAACAG GCAGTTCAGC CTCGTTAAAT 450 

TCAGCATGGG TATCACTTAT GAAAATTCAT CTACATTGGT GATAATAGTA 500 

AATCCAGTAG GGCGAAATAA TTGACTGTAA TTTACGGGGC AAT^CGGCAC 550 

AATCTCAAAC GAGATTGTGC CGTTTAAGGG GAAGATTCTA GAAATATTTC 600 

ATACTTCCAA CTATATAGTT AAGGAGGAGA CTGAAAATGA AGAAGTTGTT 650 

4 0 TTTTTTATTG TTATTGTTAT TCTTAATATA CTTAGGTTAT GACTACGTTA 700 

ATGAAGCACT G "^^l 



45 2) INFORMATION FOR SEQ ID NO: 1072 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 751 bases 

(B) TYPE: Nucleic acid 
50 (C) STRANDEDNESS: Double 

(D) TOPOLOGY: Linear 

(ii) MOLECULE TYPE: Genomic DNA 

55 (vi) ORIGINAL SOURCE: 

(A) ORGANISM: Enterococcus faecium 

(B) STRAIN: R688 



60 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 1072 

608 



SUBSTITUTE SHEET (RULE 26) 



wo 01/23604 

GCCATTGATC TTACGCTTTA TCGATTAGAC 
GGGGAGCCGA TTTGATTTTA TGGATGAACG 
GAATATCATG CAATGAAGCG CAAAATCGCA 
GAAAACAGTG GGTTTGAAGC ATATAGCCTC 
5 AAGAGACGAA CCATACCCCA ATAGCTATTT 
CTTTTAACCG TTGCACGGAC AAACTATATA 
AACCCGACGT ATGTAACTGG TTCTTAGGGA 
ATTGAAGATG TAAGGCAGAG CGATATTGCG 
CGGCAAGATA GCCTGATAAT AAGACTGATC 

10 ACACCGCCCA TTGTCAACAG GCAGTTCAGC 
TATCACTTAT GAAAATTCAT CTACATTGGT 
GGCGAAATAA TTGACTGTAA TTTACGGGGC 
GAGATTGTGC CGTTTAAGGG GAAGATTCTA 
CTATATAGTT AAGGAGGAGA CTGAAAATGA 

15 TTATTGTTAT TCTTAATATA CTTAGGTTAT 
G 



PCT/CAOO/01150 

ACGGGTNAGC TTGTACCAAT 50 

CTCTCATCAT GCGGCAAATG 100 

GACGTTTGCG CTCCATCATG 150 

GAATGGTGGC ACTATGTATT 2 00 

TGATTTCCCC GTTAAATAAA 250 

AGCTAACTCT TTCGGCAGGA 300 

ATTTATATAT AGTAGATAGT 3 50 

GTCATTATCT GCGTGCGCTG 4 00 

GCATAGAGGG GTGGTATTTC 4 50 

CTCGTTAAAT TCAGCATGGG 500 

GATAATAGTA AATCCAGTAG 550 

AAAACGGCAC AATCTCAAAC 600 

GAAATATTTC ATACTTCCAA 650 

AGAAGTTGTT TTTTTTATTG 700 

GACTACGTTA ATGAAGCACT 750 

751 



20 2) INFORMATION FOR SEQ ID NO: 1073 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 685 bases 

(B) TYPE: Nucleic acid 
25 (C) STRANDEDNESS : Double 

(D) TOPOLOGY: Linear 

(ii) MOLECULE TYPE: Genomic DNA 

30 (vi) ORIGINAL SOURCE: 

(A) ORGANISM: Enterococcus flavescens 

(B) STRAIN: R689 



35 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO : 1073 



ATTTGATTTT ATGGATGAAC GCTCTCATCA TGCGGCAAAT GGAATATCAT 50 

GCAATGAAGC GCAAAATCGC AGACGTTTGC GCTCCATCAT GGAAAACAGT 100 

GGGTTTGAAG CATATAGCCT CGAATGGTGG CACTATGTAT TAAGAGACGA 150 

ACCATACCCC AATAGCTATT TTGATTTCCC CGTTAAATAA ACTTTTAACC 200 

40 GTTGCACGGA CAAACTATAT AAGCTAACTC TTTCGGCAGG AAACCCGACG 250 

TATGTAACTG GTTCTTAGGG AATTTATATA TAGTAGATAG TATTGAAGAT 3 00 

GTAAGGCAGA GCGATATTGC GGTCATTATC TGCGTGCGCT GCGGCAAGAT 3 50 

AGCCTGATAA TAAGACTGAT CGCATAGAGG GGTGGTATTT CACACCGCCC 4 00 

ATTGTCAACA GGCAGTTCAG CCTCGTTAAA TTCAGCATGG GTATCACTTA 450 

45 TGAAAATTCA TCTACATTGG TGATAATAGT AAATCCAGTA GGGCGAAATA 500 

ATTGACTGTA ATTTACGGGG CAAAACGGCA CAATCTCAAA CGAGATTGTG 550 

CCGTTTAAGG GGAAGATTCT AGAAATATTT CATACTTCCA ACTATATAGT 600 

TAAGGAGGAG ACTGAAAATG AAGAAGTTGT TTTTTTTATT GTTATTGTTA 650 

TTCTTAATAT ACTTAGGTTA TGACTACGTT AATGA 685 

50 



2) INFORMATION FOR SEQ ID NO: 1074 

55 (i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 732 bases 

(B) TYPE: Nucleic acid 

(C) STRAIODEDNESS : Double 

(D) TOPOLOGY: Linear 



609 



SUBSTITUTE SHEET (RULE 26) 



wo 01/23604 

(ii) MOLECULE TYPE: Genomic DNA 



PCT/CAOO/01150 



(vi) ORIGINAL SOURCE: 

(A) ORGANISM: Enterococcus faecium 

(B) STRAIN: R690 



(xi) SEQUENCE DESCRIPTION: SEQ 



ID NO: 1074 



ATCGATTAGA 
ATGGATGAAC 
GCAAT^TCGC 
CATATAGCCT 
AATAGCTATT 
CAAACTATAT 
GTTCTTAGGG 
GCGATATTGC 
TAAGACTGAT 
GGCAGTTCAG 
TCTACATTGG 
ATTTACGGGG 
GGAAGATTCT 
ACTGAAAATG 
ACTTAGGTTA 



CACGGGTGAG 
GCTCTCATCA 
AGACGTTTGC 
CGAATGGTGG 
TTGATTTCCC 
AAGCTAACTC 
AATTTATATA 
GGTCATTATC 
CGCATAGAGG 
CCTCGTTAAA 
TGATAATAGT 
CAAAACGGCA 
AGAAATATTT 
AAGAAGTTGT 
TGACTACGTT 



CTTGTACCAA 
TGCGGCAAAT 
GCTCCATCAT 
CACTATGTAT 
CGTTAAATAA 
TTTCGGCAGG 
TAGTAGATAG 
TGCGTGCGCT 
GGTGGTATTT 
TTCAGCATGG 
AAATCCAGTA 
CAATCTCAAA 
CATACTTCCA 
TTTTTTTATT 
AATGAAGCAC 



TGGGGAGCCG ATTTGATTTT 
GGAATATCAT GCAATGAAGC 
GGAAAACAGT GGGTTTGAAG 
TAAGAGACGA ACCATACCCC 
ACTTTTAACC GTTGCACGGA 
AAACCCGACG TATGTAACTG 
TATTGAAGAT GTAAGGCAGA 
GCGGCAAGAT AGCCTGATAA 
CACACCGCCC ATTGTCAACA 
GTATCACTTA TGAAAATTCA 
GGGCGAAATA ATTGACTGTA 
CGAGATTGTG CCGTTTAAGG 
ACTATATAGT TAAGGAGGAG 
GTTATTGTTA TTCTTAATAT 
TG 



50 
100 
150 
200 
250 
300 
350 
400 
450 
500 
550 
600 
650 
700 
732 



2) INFORMATION FOR SEQ ID NO: 1075 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 670 bases 

(B) TYPE: Nucleic acid 

(C) STRANDEDNESS : Double 

(D) TOPOLOGY: Linear 

(ii) MOLECULE TYPE: Genomic DNA 

(vi) ORIGINAL SOURCE: 

(A) ORGANISM: Enterococcus gallinajrum 

(B) STRAIN: R691 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 1075 

TCTCATCATG CGGCAAATGG AATATCATGC AATGAAGCGC AAAATCGCAG 50 

ACGTTTGCGC TCCATCATGG AAAACAGTGG GTTTGAAGCA TATAGCCTCG 100 

AATGGTGGCA CTATGTATTA AGAGACGAAC CATACCCCAA TAGCTATTTT 150 

GATTTCCCCG TTAAATAAAC TTTTAACCGT TGCACGGACA AACTATATAA 2 00 

GCTAACTCTT TCGGCAGGAA ACCCGACGTA TGTAACTGGT TCTTAGGGAA 250 

TTTATATATA GTAGATAGTA TTGAAGATGT AAGGCAGAGC GATATTGCGG 300 

TCATTATCTG CGTGCGCTGC GGCAAGATAG CCTGATAATA AGACTGATCG 350 

CATAGAGGGG TGGTATTTCA CACCGCCCAT TGTCAACAGG CAGTTCAGCC 400 

TCGTTAAATT CAGCATGGGT ATCACTTATG AAAATTCATC TACATTGGTG 450 

ATAATAGTAA ATCCAGTAGG GCGAAATAAT TGACTGTAAT TTACGGGGCA 500 

AAACGGCACA ATCTCAAACG AGATTGTGCC GTTTAAGGGG AAGATTCTAG 550 

AAATATTTCA TACTTCCAAC TATATAGTTA AGGAGGAGAC TGAAAATGAA 600 

GAAGTTGTTT TTTTTATTGT TATTGTTATT CTTAATATAC TTAGGTTATG 650 

ACTACGTTAA TGAAGCACTG 670 



2) INFORMATION FOR SEQ ID NO: 1076 



610 
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wo 01/23604 



PCT/CAOO/01150 



(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 948 bases 

(B) TYPE: Nucleic acid 

5 (C) STRANDEDNESS : Double 

(D) TOPOLOGY: Linear 

(ii) MOLECULE TYPE: Genomic DNA 

10 (vi) ORIGINAL SOURCE: 

(A) ORGANISM: Escherichia coli 

(B) STRAIN: DG131/3 serotype 0X3 :H8 

(C) ACCESSION NUMBER: Z36901 

15 (xi) SEQUENCE DESCRIPTION: SEQ ID NO : 1076 

ATGAAAATAA TAATTTTTAG AGTGCTAACT TTTTTCTTTG TTATCTTTTC 50 

TGTTAATGTG GTTGCGAAGG AATTTACCTT AGATTTCTCG ACAGCAAAGA 100 

CGTATGTAGA TTCGCTGAAT GTCATTCGCT CTGCAATAGG TACTCCATTA 150 

20 CAGACTATTT CATCAGGAGG TACGTCTTTA CTGATGATTG ATAGTGGCAC 2 00 

AGGGGATAAT TTGTTTGCAG TTGATGTCAG AGGGATAGAT CCAGAGGAAG 250 

GGCGGTTTAA TAATCTACGG CTTATTGTTG AACGAAATAA TTTATATGTG 300 

ACAGGATTTG TTAACAGGAC AAATAATGTT TTTTATCGCT TTGCTGATTT 3 50 

TTCACATGTT ACCTTTCCTG GTACAACTGC GGTTACATTG TCTGGTGACA 4 00 

25 GTAGCTATAC CACGTTACAG CGTGTTGCGG GGATCAGTCG TACGGGGATG 450 

CAGATAAATC GCCATTCGTT GACTACTTCT TATCTGGATT TAATGTCGCA 500 

TAGCGGAACC TCACTGACGC AGTCTGTGGC AAGAGCGATG TTACGGTTTG 550 

TTACTGTGAC AGCTGAAGCT TTACGTTTTC GGCAAATTCA GAGGGGATTT 600 

CGTACAACAC TTGATGATCT CAGTGGGCGT TCTTATGTAA TGACTGCTGA 650 

30 AGATGTTGAT CTTACGTTGA ACTGGGGAAG GTTGAGTAGT GTCCTGCCTG 700 

ACTATCATGG ACAAGACTCT GTTCGTGTTG G7AGAATTTC TTTTGGAAGT 750 

GTTAATGCAA TTCTGGGTAG CGTGGCATTA ATACTGAATT GTCATCATGA 800 

TGCATCGCGA GTTGCCAGAA TTGTACCTAA TGAGTTTCCT TCTATGTGCC 850 

CGGTAGATGG AAGAGTGCGT GGGATTACGC ACAATAAAAT ATTGTGGGAC 900 

35 TCATCCACTC TGGGGGCAAT TTTGATACGC AGGGCTATTA GCAGTTGA 948 



40 



55 



2) INFORMATION FOR SEQ ID NO: 1077 



(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 1259 bases 

(B) TYPE: Nucleic acid 

(C) STRANDEDNESS: Double 
45 (D) TOPOLOGY: Linear 

(ii) MOLECULE TYPE: Genomic DNA 

(vi) ORIGINAL SOURCE: 
50 (A) ORGANISM: Escherichia coli 

(B) STRAIN: 94C serotype 048:H21 

(C) ACCESSION NUMBER: extracted from Z37725 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 1077 



CACCTGTATA TGAAGTGTAT ATTATTTAAA TGGGTACTGT GCCTGTTACT 50 

GGGTTTTTCT TCGGTATCCT ATTCCCGGGA GTTTACGATA GACTTTTCGA 100 

CCCAACAAAG TTATGTCTCT TCGTTAAATA GTATACGGAC AGAGATATCG 150 

ACCCCTCTTG AACATATATC TCAGGGGACC ACATCGGTGT CTGTTATTAA 200 

60 CCACACCCCA CCGGGCAGTT ATTTTGCTGT GGATATACGA GGGCTTGATG 250 
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TCTATCAGGC GCGTTTTGAC 
TTATATGTGG CCGGGTTCGT 
TTCAGATTTT ACACATATAT 
CAACGGACAG CAGTTATACC 
5 TCCGGAATGC AAATCAGTCG 
AATGGAGTTC AGTGGTAATA 
TGCGTTTTGT CACTGTCACA 
AGAGAATTTC GTCAGGCACT 
GCCGGGAGAC GTGGACCTCA 

10 TTCCGGAGTA TCGGGGAGAG 
AATAATATAT CAGCGATACT 
TCATCAGGGG GCGCGTTCTG 
AATGTCAGAT AACTGGCGAC 
TGGGAAAGTA ATACAGCTGC 

15 ATATACAACG GGTAAATAAA 
GGTTTTATTT GCATTAGTTT 
AAGGTAAAAT TGAGTTTTCC 
AAGGTTGACG GGAAAGAATA 
ACTGCAAAGT GCTCAGCTGA 

2 0 CCTGTGAATC AGGCTCCGGA 
GGCATAACC 



CATCTTCGGC TGATTATTGA 
TAATACGGCA ACAAATACTT 
CAGTGCCCGA TGTGACAACG 
ACTCTGCAAC GTGTCGCAGC 
TCACTCACTG GTTTCATCAT 
CT^TGACCAG AGATGCATCC 
GCAGAAGCCT TACGCTTCAG 
GTCTGAAACT GCTCCTGTGT 
CTCTGAACTG GGGGCGAATC 
GATGGTGTCA GAGTGGGGAG 
GGGTACTGTG GCCGTTATAC 
TTCGCGCCGT GAATGAAGAG 
AGGCCCGTTA TAAAAATAAA 
AGCGTTTCTG AACAGAAAGT 
GGAGTTAAGT ATGAAGAAGA 
CTGTTAATGC AATGGCGGCG 
AAGTATAATG AGGATGACAC 
CTGGACCAGT CGCTGGAATC 
CAGGAATGAC TGTCACAATC 
TTTGCTGAAG TGCAGTTTAA 



PCT/CAOO/01150 



GCAAAATAAT 


300 


TCTACCGTTT 


350 


GTTTCCATGA 


400 


GCTGGAACGT 


450 


ATCTGGCGTT 


500 


AGAGCAGTTC 


550 


GCAGATACAG 


600 


ATACCATGAC 


650 


AGCAATGTGC 


700 


AATATCCTTT 


750 


TGTU^TTGCCA 


800 


AGTCAACCAG 


850 


CAATACATTA 


900 


CACAGTTTTT 


950 


TGTTTATGGC 


1000 


GATTGTGCTA 


1050 


ATTTACAGTG 


1100 


TGCAACCGTT 


1150 


AAATCCAGTA 


1200 


TAATGACTGA 


1250 




1259 



25 2) INFORMATION FOR SEQ ID NO: 1078 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 22 bases 

(B) TYPE: Nucleic acid 
30 (C) STRANDEDNESS : Single 

(D) TOPOLOGY: Linear 

(ii) MOLECULE TYPE: DNA 
35 (xi) SEQUENCE DESCRIPTION: SEQ ID NO: 1078 

AGTTCTGCGT TTTGTCACTG TC 22 
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2) INFORMATION FOR SEQ ID NO: 1079 



(i) SEQUENCE CHARACTERISTICS: 
(A) LENGTH: 19 bases 
45 (B) TYPE: Nucleic acid 

(C) STRANDEDNESS : Single 

(D) TOPOLOGY: Linear 



(ii) MOLECULE TYPE: DNA 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 1079 
CGGAAGCACA TTGCTGATT 19 

2 ) INFORMATION FOR SEQ ID NO: 1080 



(i) SEQUENCE CHARACTERISTICS: 
60 (A) LENGTH: 25 bases 
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(B) TYPE: Nucleic acid 

(C) STRANDEDNESS : Single 

(D) TOPOLOGY: Linear 

(ii) MOLECULE TYPE: DNA 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 1080 
TATAGCTACT GTCACCAGAC AATGT 



2 ) INFORMATION FOR SEQ ID NO: 1081 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 2 0 bases 

(B) TYPE: Nucleic acid 

(C) STRANDEDNESS: Single 

(D) TOPOLOGY: Linear 

(ii) MOLECULE TYPE: DNA 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 1081 
ATGTCAGAGG GATAGATCCA 



2) INFORMATION FOR SEQ ID NO: 1082 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 23 bases 

(B) TYPE: Nucleic acid 

(C) STRANDEDNESS: Single 

(D) TOPOLOGY: Linear 

(ii) MOLECULE TYPE: DNA 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 1082 
TTGARCRAAA TAATTTATAT GTG 



2 ) INFORMATION FOR SEQ ID NO: 1083 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 20 bases 

(B) TYPE: Nucleic acid 

(C) STRANDEDNESS: Single 

(D) TOPOLOGY: Linear 

(ii) MOLECULE TYPE: DNA 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 1083 
TGATGATGRC AATTCAGTAT 
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2) INFORMATION FOR SEQ ID NO: 1084 

(i) SEQUENCE CHTU^CTERISTICS : 

(A) LENGTH: 3 9 bases 

(B) TYPE: Nucleic acid 

(C) STRANDEDNESS : Single 

(D) TOPOLOGY: Linear 

(ii) MOLECULE TYPE: DNA 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 1084 
CCACGCCGCT TTGCTGATTT TTCACATGTT ACCGCGTGG 3 9 



2) INFORMATION FOR SEQ ID NO: 1085 

(i) SEQUENCE CHARACTERISTICS: 
20 (A) LENGTH: 34 bases 

(B) TYPE: Nucleic acid 

(C) STRANDEDNESS: Single 

(D) TOPOLOGY: Linear 

25 (ii) MOLECULE TYPE: DNA 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 1085 
CCACGCCACT GTCTGAAACT GCTCCTGTGC GTGG 34 

30 

2) INFORMATION FOR SEQ ID NO: 1086 

35 (i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 2 0 bases 

(B) TYPE: Nucleic acid 

(C) STRANDEDNESS: Single 

(D) TOPOLOGY: Linear 

40. 

(ii) MOLECULE TYPE: DNA 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 1086 
45 CTACTCCCGC CTTTTGGGTT 20 



2) INFORMATION FOR SEQ ID NO: 1087 



(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 20 bases 

(B) TYPE: Nucleic acid 

(C) STRANDEDNESS: Single 
55 (D) TOPOLOGY: Linear 

(ii) MOLECULE TYPE: DNA 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 1087 

60 
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CTCACAGCCC GAAACAGCCT 
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20 



5 2) INFORMATION FOR SEQ ID NO: 1088 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 2 0 bases 

(B) TYPE: Nucleic acid 
10 (C) STRANDEDNESS : Single 

(D) TOPOLOGY: Linear 

(ii) MOLECULE TYPE: DNA 
15 (xi) SEQUENCE DESCRIPTION: SEQ ID NO: 1088 

TGCCGTTTCC TGTATCCGTC 20 



20 



30 



35 



2) INFORMATION FOR SEQ ID NO: 1089 



(i) SEQUENCE CHARACTERISTICS: 
(A) LENGTH: 20 bases 
25 (B) TYPE: Nucleic acid 

(C) STRANDEDNESS: Single 

( D) TOPOLOGY : Linear 



(ii) MOLECULE TYPE: DNA 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 1089 
ATCCACACGG GCTAGACCTC 20 

2) INFORMATION FOR SEQ ID NO: 1090 



(i) SEQUENCE CHTU^CTERISTICS : 
40 (A) LENGTH: 21 bases 

(B) TYPE: Nucleic acid 

(C) STRANDEDNESS: Single 

(D) TOPOLOGY: Linear 

4 5 (ii) MOLECULE TYPE: DNA 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 1090 
AATAGCGCGG ACGAATTGGA C 21 

50 



2) INFORMATION FOR SEQ ID NO: 1091 

55 (i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 20 bases 

(B) TYPE: Nucleic acid 

(C) STRANDEDNESS: Single 

(D) TOPOLOGY: Linear 

60 
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(ii) MOLECULE TYPE: DNA 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 10 91 
AACGCGGCAC TGTTTCCCAA 



2) INFORMATION FOR SEQ ID NO: 1092 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 21 bases 

(B) TYPE: Nucleic acid 

(C) STRANDEDNESS : Single 

(D) TOPOLOGY: Linear 

(ii) MOLECULE TYPE: DNA 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 1092 
TCGGCAAGAC AATATGACAG C 



2) INFORMATION FOR SEQ ID NO: 1093 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 381 bases 

(B) TYPE: Nucleic acid 

(C) STRANDEDNESS: Double 

(D) TOPOLOGY: Linear 

(ii) MOLECULE TYPE: Genomic DNA 

(vi) ORIGINAL SOURCE: 

(A) ORGANISM: Staphylococcus saprophyticus 

(B) STRAIN: CSsa-165 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 1093 

TAACGGGCGT CTCGATAGAA AAACACGTGA AAATCCCAAT GATTATAAAC 50 

AATCAATATA CGATTTTGCT GAAGCTGTAA CAAAAGGTAT TAAGGAACAA 100 

ACAAATAAAA ATTAATAGGC AACTTAACCA GAATCGTTAA AACTATATGA 150 

CGATTCTGGT TTTTTAAATT CAAAAAGTTT TCTAAAAAAT TTACCTGCTT 200 

TTTTAA^^GTA TAGGTATAAA ATACAATTGA TTAAAATAGT AAAGGAAATG 250 

AATCATGAAA CAATTAACTA AGCCTTTATA CTTTTACCTA TTACTTTTTA 3 00 

TTACAACAAC ACTGATTGGC GCGTTACTAT TATATTTGCC AATCACAGGT 350 

AAACATCCTA TTGATTTTGT GGACGCCCGT T 381 



2) INFORMATION FOR SEQ ID NO: 1094 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 23 bases 

(B) TYPE: Nucleic acid 

(C) STRANDEDNESS: Single 

(D) TOPOLOGY: Linear 

(ii) MOLECULE TYPE: DNA 
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(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 1094 
GGTAAAACAG GTACTTCTAA CTA 
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23 



2) INFORMATION FOR SEQ ID NO: 1095 

10 (i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 20 bases 

(B) TYPE: Nucleic acid 

(C) STRANDEDNESS : Single 

(D) TOPOLOGY: Linear 

15 

(ii) MOLECULE TYPE: DNA 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 1095 
20 CGATAGAAGC AGCAGGACAA 20 



2) INFORMATION FOR SEQ ID NO: 1096 

25 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 2 0 bases 

(B) TYPE: Nucleic acid 
30 (C) STRANDEDNESS: Single 

(D) TOPOLOGY: Linear 

(ii) MOLECULE TYPE: DNA 
35 (xi) SEQUENCE DESCRIPTION: SEQ ID NO: 1096 

CTGATGGATG CGGAAGATAC 2 0 
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2) INFORMATION FOR SEQ ID NO: 1097 



(i) SEQUENCE CHARACTERISTICS: 
(A) LENGTH: 26 bases 
45 (B) TYPE: Nucleic acid 

(C) STRANDEDNESS: Single 

(D) TOPOLOGY: Linear 



(ii) MOLECULE TYPE: DNA 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 1097 
TCYTCAAAAG GGATCACWAA AGTMAC 26 

2) INFORMATION FOR SEQ ID NO: 1098 



(i) SEQUENCE CHAJ^CTERISTICS: 
60 (A) LENGTH: 23 bases 
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(B) TYPE: Nucleic acid 

(C) STRANDEDNESS : Single 

(D) TOPOLOGY: Linear 

(ii) MOLECULE TYPE: DNA 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 1098 
TCTTCAAAAT CGAAAAAGCC GTC 23 

2) INFORMATION FOR SEQ ID NO: 1099 



(i) SEQUENCE CHARACTERISTICS: 
15 (A) LENGTH: 23 bases 

(B) TYPE: Nucleic acid 

(C) STRANDEDNESS: Single 

(D) TOPOLOGY: Linear 

20 (ii) MOLECULE TYPE: DNA 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 1099 
TCAAAAGGGA TCACWAAAGT MAC 23 

25 

2) INFORMATION FOR SEQ ID NO: 1100 

30 (i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 26 bases 

(B) TYPE: Nucleic acid 

(C) STRANDEDNESS: Single 

(D) TOPOLOGY: Linear 

35 

(ii) MOLECULE TYPE: DNA 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 1100 
40 GTAT^KCCCG GCATRGTRTT GATTTC 26 



2) INFORMATION FOR SEQ ID NO: 1101 



(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 23 bases 

(B) TYPE: Nucleic acid 

(C) STRANDEDNESS: Single 
50 (D) TOPOLOGY: Linear 

(ii) MOLECULE TYPE: DNA 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 1101 

55 

GACGGYTTTT TYGATTTTGA AGA 23 



60 2) INFORMATION FOR SEQ ID NO: 1102 
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(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 23 bases 

(B) TYPE: Nucleic acid 

(C) STRANDEDNESS : Single 

(D) TOPOLOGY: Linear 

(ii) MOLECULE TYPE: DNA 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 1102 
AAAAARTCGA TKCGAGCMAG ACC 



2) INFORMATION FOR SEQ ID NO : 1103 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 20 bases 

(B) TYPE: Nucleic acid 

(C) STRANDEDNESS: Single 

(D) TOPOLOGY: Linear 

(ii) MOLECULE TYPE: DNA 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 1103 
ATCCCGCTAT GAAAACGATC 



2) INFORMATION FOR SEQ ID NO: 1104 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 2 0 bases 

(B) TYPE: Nucleic acid 

(C) STRANDEDNESS: Single 

(D) TOPOLOGY: Linear 

(ii) MOLECULE TYPE: DNA 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 1104 
GGATCAACAC AGTAGAACCG 



2) INFORMATION FOR SEQ ID NO: 1105 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 25 bases 

(B) TYPE: Nucleic acid 

(C) STRANDEDNESS: Single 

(D) TOPOLOGY: Linear 

(ii) MOLECULE TYPE: DNA 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 1105 
CTCCTACGAT TCTCTTGAYA AATCA 
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2) INFORMATION FOR SEQ ID NO: 1106 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 23 bases 

(B) TYPE: Nucleic acid 

(C) STRANDEDNESS : Single 

(D) TOPOLOGY: Linear 

(ii) MOLECULE TYPE: DNA 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 1106 
CAACCGATCT CAACACCGGC AAT 



2) INFORMATION FOR SEQ ID NO: 1107 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 23 bases 

(B) TYPE: Nucleic acid 

(C) STRANDEDNESS: Single 

(D) TOPOLOGY: Linear 

(ii) MOLECULE TYPE: DNA 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 1107 
CTCATTTGAC TTCCTCCTTT GCT 



2) INFORMATION FOR SEQ ID NO: 1108 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 23 bases 

(B) TYPE: Nucleic acid 

(C) STRANDEDNESS: Single 

(D) TOPOLOGY: Linear 

(ii) MOLECULE TYPE: DNA 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 1108 
GTAAGAATCG GAAAAGCGGA AGG 



2) INFORMATION FOR SEQ ID NO : 1109 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 23 bases 

(B) TYPE: Nucleic acid 

(C) STRANDEDNESS: Single 

(D) TOPOLOGY: Linear 

(ii) MOLECULE TYPE: DNA 
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(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 1109 
ACATCGTGAT CGCTAAAAGG AGC 
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2) INFORMATION FOR SEQ ID NO: 1110 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 23 bases 

(B) TYPE: Nucleic acid 

(C) STRANDEDNESS : Single 

(D) TOPOLOGY: Linear 

(ii) MOLECULE TYPE: DNA 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 1110 
ACGAGAAAGA CAACAGGAAG ACC 



2) INFORMATION FOR SEQ ID NO: 1111 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 2 5 bases 

(B) TYPE: Nucleic acid 

(C) STRANDEDNESS: Single 

(D) TOPOLOGY: Linear 

(ii) MOLECULE TYPE: DNA 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO : 1111 
CTTTTTCCGG CTCGWYTTCC TGATG 



2) INFORMATION FOR SEQ ID NO: 1112 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 2 0 bases 

(B) TYPE: Nucleic acid 

(C) STRANDEDNESS: Single 

(D) TOPOLOGY: Linear 

(ii) MOLECULE TYPE: DNA 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO : 1112 
GGCTGYGATA TTCAAAGCTC 



2) INFORMATION FOR SEQ ID NO: 1113 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 2 0 bases 

(B) TYPE: Nucleic acid 
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(C) STRANDEDNESS : Single 

(D) TOPOLOGY: Linear 

(ii) MOLECULE TYPE: DNA 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 1113 
ACCGACCTCA CAGCCCGAAA 20 



2 ) INFORMATION FOR SEQ ID NO : 1114 



(i) SEQUENCE CHARACTERISTICS: • 
15 (A) LENGTH: 21 bases 

(B) TYPE: Nucleic acid 

(C) STRANDEDNESS: Single 

(D) TOPOLOGY: Linear 

20 (ii) MOLECULE TYPE: DNA 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 1114 
TCWGAGCCTT TTTCCGGCTC G 21 

25 

2 ) INFORMATION FOR SEQ ID NO: 1115 

30 (i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 26 bases 

(B) TYPE: Nucleic acid 

(C) STRANDEDNESS: Single 

(D) TOPOLOGY: Linear 

35 

(ii) MOLECULE TYPE: DNA 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 1115 
40 TTTCGGGCTG TGAGGTCGGB TGHGCG 26 



2) INFORMATION FOR SEQ ID NO: 1116 



(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 27 bases 

(B) TYPE: Nucleic acid 

(C) STRANDEDNESS : Single 
50 (D) TOPOLOGY: Linear 

(ii) MOLECULE TYPE: DNA 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 1116 

55 

TTTCGGGCTG TGAGGTCGGB TGHGCGG 27 



60 2) INFORMATION FOR SEQ ID NO: 1117 
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(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 801 bases 

(B) TYPE: Nucleic acid 

(C) STRANDEDNESS : Double 

(D) TOPOLOGY: Linear 

(ii) MOLECULE TYPE: Genomic DNA 

(vi) ORIGINAL SOURCE: 

(A) ORGANISM: Enterococcus faecium 

(B) STRAIN: 94 

(C) ACCESSION NUMBER: U94526 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 1117 

AAATTCGATC CGCACTACAT CGGAATTACA AAAAACGGTG TATGGAAGCT 50 

ATGCAAGAAG CCATGTACGG AATGGGAAGC CGACAGTCTC CCCGCCATAC 100 

TCTCCCCGGA TAGGAAAACG CATGGGCTGC TTGTCATGAA AGAAAGCGAA 150 

TACGAAACAC GGCGTATTGA TGTGGCTTTC CCGGTTTTGC ATGGCAAATG 2 00 

CGGGGAGGAT GGTGCGATAC AGGGGCTGTT TGTATTGTCT GGTATCCCCT 250 

ATGTGGGCTG TGATATTCAA AGCTCCGCAG CTTGCATGGA CAAATCACTG 3 00 

GCCTACATTC TTACAAAAAA TGCGGGCATC GCCGTTCCCG AATTTCAAAT 3 50 

GATTGATAAA GGTGACAAGC CGGAGGCGGG TGCGCTTACC TACCCTGTCT 4 00 

TTGTGAAGCC GGCACGGTCA GGTTCGTCCT TTGGCGTAAC CAAAGTAAAC 450 

GGTACGGAAG AACTTAACGC TGCGATAGAA GCGGCAGGAC AATATGATGG 500 

AAAAATCTTA ATTGAGCAAG CGATTTCGGG CTGTGAGGTC GGGTGTGCGG 550 

TCATGGGGAA CGAGGATGAT TTGATTGTCG GCGAAGTGGA TCAAATCCGG 600 

CTGAGCCACG GTATCTTCCG CATCCATCAG GAAAACGAGC CGGAAAAAGG 650 

CTCAGAAAAT GCGATGATTA CAGTTCCCGC AGACATTCCG GTCGAGGAAC 700 

GAAATCGGGT GCAGGAAACG GCAAAGAAAG TATATCGGGT GCTTGGATGC 750 

AGAGGGCTTG CCCGTGTTGA TCTTTTTTTG CAGGAGGATG GCGGCATCGT 800 

T 801 



2) INFORMATION FOR SEQ ID NO: 1118 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 24 bases 

(B) TYPE: Nucleic acid 

(C) STRANDEDNESS: Single 

(D) TOPOLOGY: Linear 

(ii) MOLECULE TYPE: DNA 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 1118 
TTTTCWGAGC CTTTTTCCGG CTCG 



2) INFORMATION FOR SEQ ID NO: 1119 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 25 bases 

(B) TYPE: Nucleic acid 

(C) STRANDEDNESS: Single 

(D) TOPOLOGY: Linear 
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(ii) MOLECULE TYPE: DNA 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 1119 
TTTCGGGCTG TGAGGTCGGB TGHGC 



2) INFORMATION FOR SEQ ID NO: 1120 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 24 bases 

(B) TYPE: Nucleic acid 

(C) STRANDEDNESS : Single 

(D) TOPOLOGY : Linear 

(ii) MOLECULE TYPE: DNA 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 1120 
TTTCGGGCTG TGAGGTCGGB TGHG 



2) INFORMATION FOR SEQ ID NO: 1121 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 22 bases 

(B) TYPE: Nucleic acid 

(C) STRANDEDNESS: Single 

(D) TOPOLOGY: Linear 

(ii) MOLECULE TYPE: DNA 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 1121 
TGTTTGWATT GTCYGGYATC CC 



2) INFORMATION FOR SEQ ID NO: 1122 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 18 bases 

(B) TYPE: Nucleic acid 

(C) STRANDEDNESS: Single 

(D) TOPOLOGY: Linear 

(ii) MOLECULE TYPE: DNA 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 1122 
TGGTGCATTG CTACGTGG 



2 ) INFORMATION FOR SEQ ID NO: 1123 

(i) SEQUENCE CHARACTERISTICS: 
(A) LENGTH: 22 bases 
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(B) TYPE: Nucleic acid 

(C) STRANDEDNESS : Single 

(D) TOPOLOGY: Linear 

(ii) MOLECULE TYPE: DNA 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 1123 
TTTCGGGCTG TGAGGTCGGB TG 



2) INFORMATION FOR SEQ ID NO: 1124 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 22 bases 

(B) TYPE: Nucleic acid 

(C) STRANDEDNESS: Single 

(D) TOPOLOGY: Linear 

(ii) MOLECULE TYPE: DNA 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 1124 
GATTTGRTCC ACYTCGCCRA CA 



2) INFORMATION FOR SEQ ID NO: 1125 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 20 bases 

(B) TYPE: Nucleic acid 

(C) STRANDEDNESS: Single 

(D) TOPOLOGY: Linear 

(ii) MOLECULE TYPE: DNA 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO : 1125 
ACTCACAACT GGGATGGATG 



2 ) INFORMATION FOR SEQ ID NO: 112 6 

(i) SEQUENCE CH7VRACTERISTICS : 

(A) LENGTH: 21 bases 

(B) TYPE: Nucleic acid 

(C) STRANDEDNESS: Single 

(D) TOPOLOGY: Linear 

(ii) MOLECULE TYPE: DNA 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO : 1126 
TTATGGTTGT GCTGGTTGAG G 
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2) INFORMATION FOR SEQ ID NO: 1127 



PCT/CAOO/01150 



(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 2 0 bases 

(B) TYPE: Nucleic acid 

(C) STRANDEDNESS : Single 

(D) TOPOLOGY: Linear 

(ii) MOLECULE TYPE: DNA 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 1127 
KCAAAYGCCA TTTCAAGTAA 



2) INFORMATION FOR SEQ ID NO: 1128 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 2 0 bases 

(B) TYPE: Nucleic acid 

(C) STRANDEDNESS: Single 

(D) TOPOLOGY : Linear 

(ii) MOLECULE TYPE: DNA 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 1128 
GACGACYTTA TKGATATACA 



2) INFORMATION FOR SEQ ID NO: 112 9 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 23 bases 

(B) TYPE: Nucleic acid 

(C) STRANDEDNESS: Single 

(D) TOPOLOGY : Linear 

(ii) MOLECULE TYPE: DNA 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 1129 
ATGATGACHG T^MATGATGAA AAC 



2) INFORMATION FOR SEQ ID NO: 1130 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 22 bases 

(B) TYPE: Nucleic acid 

(C) STRANDEDNESS: Single 

(D) TOPOLOGY: Linear 

(ii) MOLECULE TYPE: DNA 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 1130 
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GACTATCCAA GCATGCATTA TG 22 



5 2 ) INFORMATION FOR SEQ ID NO: 1131 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 22 bases 

(B) TYPE: Nucleic acid 
10 (C) STRANDEDNESS : Single 

(D) TOPOLOGY: Linear 

(ii) MOLECULE TYPE: DNA 
15 (Xi) SEQUENCE DESCRIPTION: SEQ ID NO: 1131 

CATCTGGAGC TACRTARCCA GT 22 



20 



30 



35 



60 



2 ) INFORMATION FOR SEQ ID NO: 1132 



(i) SEQUENCE CHARACTERISTICS: 
• (A) LENGTH: 20 bases 
25 (B) TYPE: Nucleic acid 

(C) STRANDEDNESS : Single 

(D) TOPOLOGY: Linear 



(ii) MOLECULE TYPE: DNA 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 1132 
AGTGT^AAARA TGGCTGCTGC 20 

2) INFORMATION FOR SEQ ID NO: 1133 



(i) SEQUENCE CHARACTERISTICS: 
40 (A) LENGTH: 23 bases 

(B) TYPE: Nucleic acid 

(C) STRANDEDNESS: Single 

(D) TOPOLOGY: Linear 

45 (ii) MOLECULE TYPE: DNA 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 1133 
CATCAAGAAC ACTGGCTAYG TAG 23 

50 

2) INFORMATION FOR SEQ ID NO: 1134 

55 (i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 23 bases 

(B) TYPE: Nucleic acid 

(C) STRANDEDNESS : Single 

(D) TOPOLOGY: Linear 
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(ii) MOLECULE TYPE: DNA 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 1134 
5 CTAGATAGAG CTAAAACCTT CCT 23 



2) INFORMATION FOR SEQ ID NO: 1135 



10 



(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 23 bases 

(B) TYPE: Nucleic acid 

(C) STRANDEDNESS : Single 
15 (D) TOPOLOGY: Linear 

(ii) MOLECULE TYPE: DNA 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 1135 

20 

CATTATGCAA ACGCCATTTC AAG 23 



25 2) INFORMATION FOR SEQ ID NO: 1136 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 21 bases 

(B) TYPE: Nucleic acid 
30 (C) STRANDEDNESS: Single 

(D) TOPOLOGY: Linear 

(ii) MOLECULE TYPE: DNA 
35 (xi) SEQUENCE DESCRIPTION: SEQ ID NO: 1136 

ACTTGTCCAC GTTSGATRTC T 21 



40 



50 
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2 ) INFORMATION FOR SEQ ID NO: 1137 



(i) SEQUENCE CHARACTERISTICS: 
(A) LENGTH: 23 bases 
45 (B) TYPE: Nucleic acid 

(C) STRANDEDNESS: Single 

(D) TOPOLOGY: Linear 



(ii) MOLECULE TYPE: DNA 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 1137 
AATTAATGGC TGCWGTTGAY GAA 23 

2 ) INFORMATION FOR SEQ ID NO: 1138 



(i) SEQUENCE CHARACTERISTICS: 
60 (A) LENGTH: 1032 bases 
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WO 01/23604 PCT/CAOO/01150 

(B) TYPE: Nucleic acid 

(C) STRANDEDNESS : Double 

(D) TOPOLOGY: Linear 



(ii) MOLECULE TYPE: Genomic DNA 

(vi) ORIGINAL SOURCE: 

(A) ORGANISM: Enterococcus gallinarum 
(C) ACCESSION NUMBER: M75132 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 1138 

ATGAAAAAAA TTGCCGTTTT ATTTGGAGGG AATTCTCCAG AATACTCAGT 50 

GTCACTAACC TCAGCAGCAA GTGTGATCCA AGCTATTGAC CCGCTGAAAT 100 

ATGAAGTAAT GACCATTGGC ATCGCACCAA CAATGGATTG GTATTGGTAT 150 

CAAGGAAACC TCGCGAATGT TCGCAATGAT ACTTGGCTAG AAGATCACAA 200 

AAACTGTCAC CAGCTGACTT TTTCTAGCCA AGGATTTATA TTAGGAGAAA 250 

AACGAATCGT CCCTGATGTC CTCTTTCCAG TCTTGCATGG GAAGTATGGC 300 

GAGGATGGCT GTATCCAAGG ACTGCTTGAA CTAATGAACC TGCCTTATGT 3 50 

TGGTTGCCAT GTCGCTGCCT CCGCATTATG TATGAACAAA TGGCTCTTGC 400 

ATCAACTTGC TGATACCATG GGAATCGCTA GTGCTCCCAC TTTGCTTTTA 4 50 

TCCCGCTATG AAAACGATCC TGCCACAATC GATCGTTTTA TTCAAQACCA 500 

TGGATTCCCG ATCTTTATCA AGCCGAATGA AGCCGGTTCT TCAAAAGGGA 550 

TCACAAAAGT AACTGACAAA ACAGCGCTCC AATCTGCATT AACGACTGCT 60 0 

TTTGCTTACG GTTCTACTGT GTTGATCCAA AAGGCGATAG CGGGTATTGA 650 

AATTGGCTGC GGCATCTTAG GAAATGAGCA ATTGACGATT GGTGCTTGTG 700 

ATGCGATTTC TCTTGTCGAC GGTTTTTTTG ATTTTGAAGA GAAATACCAA 750 

TTAATCAGCG CCACGATCAC TGTCCCAGCA CCATTGCCTC TCGCGCTTGA 800 

ATCACAGATC AAGGAGCAGG CACAGCTGCT TTATCGAAAC TTGGGATTGA 850 

CGGGTCTGGC TCGAATCGAT TTTTTCGTCA CCAATCAAGG AGCGATTTAT 900 

TTAAACGAAA TCAACACCAT GCCGGGATTT ACTGGGCACT CCCGCTACCC 950 

AGCTATGATG GCGGAAGTCG GGTTATCCTA CGAAATATTA GTAGAGCAAT 1000 

TGATTGCACT GGCAGAGGAG GACAAACGAT GA 1032 



2 ) INFORMATION FOR SEQ ID NO: 1139 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 1768 bases 

(B) TYPE: Nucleic acid 

(C) STRANDEDNESS: Double 

(D) TOPOLOGY: Linear 

(ii) MOLECULE TYPE: Genomic DNA 

(vi) ORIGINAL SOURCE: 

(A) ORGANISM: Enterococcus faecium 

(B) STRAIN: BM4147 

(C) ACCESSION NUMBER: X56895 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 1139 



GATATCGTTA CGCTTCATGT 
CAGCCACGAA CAAATACAGA 
CTGGGCGCGG TCCACTTGTA 
AACGGGAAAC TGGGCGGTGC 
GTTTTTCTAC TCTGATTGCA 
TTAAACTTCA AAGAATGCCT 
TATACCGAGC AAGCGTTGCG 



GCCGCTCAAT ACGGATACGC 
GAATGAAGCA AGGAGCATTT 
GATACCTATG AGTTGGTTAA 
CGCATTGGAT GTATTGGAAG 
CCCAAAAACC AATTGATAAT 
AACGTGATAA TCACACCGCA 
TGATACCGTT GAAAAAACCA 

629 



ACTATATTAT 50 

CTTATCAATA 100 

AGCATTAGAA 150 

GAGAGGAAGA 2 00 

CAATTTTTAC 250 

TACGGCCTAT 300 

TTAAAAACTG 350 
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TTTGGATTTT GAAAGGAGAC AGGAGCATGA ATAGAATAAA AGTTGCAATA 400 

CTGTTTGGGG GTTGCTCAGA GGAGCATGAC GTATCGGTAA AATCTGCAAT 450 

AGAGATAGCC GCTAACATTA ATAAAGAAAA ATACGAGCCG TTATACATTG 500 

GAATTACGAA ATCTGGTGTA TGGAAAATGT GCGAAAAACC TTGCGCGGAA 550 

5 TGGGAAAACG ACAATTGCTA TTCAGCTGTA CTCTCGCCGG ATAAAAAAAT 60 0 

GCACGGATTA CTTGTTAAAA AGAACCATGA ATATGAAATC AACCATGTTG 650 

ATGTAGCATT TTCAGCTTTG CATGGCAAGT CAGGTGAAGA TGGATCCATA 700 

CAAGGTCTGT TTGAATTGTC CGGTATCCCT TTTGTAGGCT GCGATATTCA 750 

AAGCTCAGCA ATTTGTATGG ACAAATCGTT GACATACATC GTTGCGAAAA 800 

LO ATGCTGGGAT AGCTACTCCC GCCTTTTGGG TTATTAATAA AGATGATAGG 850 

CCGGTGGCAG CTACGTTTAC CTATCCTGTT TTTGTTAAGC CGGCGCGTTC 900 

AGGCTCATCC TTCGGTGTGA AAAT^GTCAA TAGCGCGGAC GAATTGGACT 950 

ACGCAATTGA ATCGGCAAGA CAATATGACA GCAAAATCTT AATTGAGCAG 1000 

GCTGTTTCGG GCTGTGAGGT CGGTTGTGCG GTATTGGGAA ACAGTGCCGC 1050 

15 GTTAGTTGTT GGCGAGGTGG ACCAAATCAG GCTGCAGTAC GGAATCTTTC 1100 

GTATTCATCA GGAAGTCGAG CCGGAAAAAG GCTCTGAAAA CGCAGTTATA 1150 

ACCGTTCCCG CAGACCTTTC AGCAGAGGAG CGAGGACGGA TACAGGAAAC 12 00 

GGCAAAAAAA ATATATAT^G CGCTCGGCTG TAGAGGTCTA GCCCGTGTGG 12 50 

ATATGTTTTT ACAAGATAAC GGCCGCATTG TACTGAACGA AGTCJ\ATACT 13 00 

20 CTGCCCGGTT TCACGTCATA CAGTCGTTAT CCCCGTATGA TGGCCGCTGC 13 50 

AGGTATTGCA CTTCCCGAAC TGATTGACCG CTTGATCGTA TTAGCGTTAA 14 00 

AGGGGTGATA AGCATGGAAA TAGGATTTAC TTTTTTAGAT GAAATAGTAC 1450 

ACGGTGTTCG TTGGGACGCT AAATATGCCA CTTGGGATAA TTTCACCGGA 1500 

AAACCGGTTG ACGGTTATGA AGTAAATCGC ATTGTAGGGA CATACGAGTT 1550 

25 GGCTGAATCG CTTTTGAAGG CAAAAGAACT GGCTGCTACC CAAGGGTACG 160 0 

GATTGCTTCT ATGGGACGGT TACCGTCCTA AGCGTGCTGT AAACTGTTTT 1650 

ATGC/^TGGG CTGCACAGCC GGAAAATAAC CTGACAAAGG AAAGTTATTA 1700 

TCCCAATATT GACCGAACTG AGATGATTTC AAAAGGATAC GTGGCTTCAA 1750 

AATCAAGCCA TAGCCGCG 1768 

30 

2) INFORMATION FOR SEQ ID NO: 1140 

35 (i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 1086 bases 

(B) TYPE: Nucleic acid 

(C) STRANDEDNESS : Double 

(D) TOPOLOGY: Linear 



40 



(ii) MOLECULE TYPE: Genomic DNA 



(vi) ORIGINAL SOURCE: 

(A) ORGANISM: Enterococcus CcLSSGliflsLVUS 
45 (C) ACCESSION NUMBER: L29638 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 1140 

GTAAGAATCG GAAAAGCGGA AGGAAGAAAA ACATGAAAAA AATCGCCATT 50 

50 ATTTTTGGAG GCAATTCACC GGAATACACC GTTTCTTTAG CTTCAGCAAC 100 

TAGCGCAATC GAAGCACTCC AATCATCTCC CTATGACTAC GACCTCTCTT 150 

TGATCGGGAT CGCCCCAGAT GCTATGGATT GGTACTTGTA TACAGGAGAA 200 

CTGGAAAACA TCCGACAAGA CACGTGGTTG TTGGATACGA AACATAAACA 250 

GAAAATACAG CCGCTATTCG AAGGAAACGG CTTTTGGCTA AGTGAAGAGC 3 00 

55 AGCAAACGTT GGTACCTGAT GTTTTATTTC CCATTATGCA TGGCAAATAC 350 

GGGGAAGATG GCAGTATCCA AGGATTGTTT GAATTGATGA AGCTGCCTTA 400 

TGTAGGCTGC GGGGTGGCAG GTTCTGCCTT ATGTATGAAC AAATGGCTGC 4 50 

TGCATCAAGC TGCAGCAGCC ATTGGCGTAC AAAGTGCTCC TACGATTCTC 500 

TTGACAAATC AAGCCAACCA GCAAGAACAA ATCGAAGCTT TTATCCAGAC 550 

60 CCATGGCTTC CCAGTTTTCT TTAAGCCTAA TGAAGCGGGC TCCTCAAAAG 600 
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GGATCACTAA AGTCACCTGC 
GCCTTTACTT ATTGTTCCGC 
TGAGATCGGT TGCGGTATTT 
GTGACGCCAT TTCATTAGTA 
5 CAGCTGATCA GCGCCAAAAT 
TGAAACCAAG GTCT^AAGAAC 
TTAAAGGTCT TGCTCGCATC 
TACTTGAATG AAATCAATAC 
TCCTGCCATG ATGGCAGCGG 
LO AACTGCTTGT CTTAGCAAAG 



GTTGAAGAAA TCGCTTCTGC 
AGTGCTCCTA CAAAAAAATA 
TGGGCAACGA CTCTTTGACT 
GACGGCTTTT TCGATTTTGA 
CACCGTCCCT GCGCCATTGC 
AAGCTCAGCT GCTCTATCGT 
GACTTTTTTG TCACGGAGCG 
TATGCCGGGC TTTACGAGTC 
TCGGCTTATC CTATCAAGAA 
GAGGAAGTCA AATGAG 



PCT/CAOO/01150 

CTTAAAAGAA 650 

TTGCCGGTGT 700 

GTCGGTGCTT 750 

AGAAAAGTAC 800 

CTGAAACGAT 850 

AGTCTTGGTC 900 

AGGAGAACTA 950 

ACTCCCGCTA 1000 

CTACTACAAA 1050 
1086 



2) INFORMATION FOR SEQ ID NO: 1141 



L5 (i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 3 94 6 bases 

(B) TYPE: Nucleic acid 

(C) STRANDEDNESS : Double 

(D) TOPOLOGY: Linear 



20 



(ii) MOLECULE TYPE: Genomic DNA 



(vi) ORIGINAL SOURCE: 

(A) ORGANISM: Enterococcus faecium 
25 (B) STRAIN: BM4147 

(C) ACCESSION NUMBER: extracted from M972 97 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 1141 



3 0 ATGAATAACA TCGGCATTAC TGTTTATGGA TGTGAGCAGG ATGAGGCAGA 50 
TGCATTCCAT GCTCTTTCGC CTCGCTTTGG CGTTATGGCA ACGATAATTA 100 
ACGCCAACGT GTCGGAATCC AACGCCAAAT CCGCGCCTTT CAATCTUVTCT 150 
ATCAGTGTGG GACATAAATC AGAGATTTCC GCCTCTATTC TTCTTGCGCT 2 00 

GAAGAGAGCC GGTGTGAAAT ATATTTCTAC CCGAAGCATC GGCTGCAATC 250 

35 ATATAGATAC AACTGCTGCT AAGAGAATGG GCATCACTGT CGACAATGTG 300 
GCGTACTCGC CGGATAGCGT TGCCGATTAT ACTATGATGC TAATTCTTAT 350 
GGCAGTACGC AACGTAAAAT CGATTGTGCG CTCTGTGGAA AAACATGATT 4 00 

TCAGGTTGGA CAGCGACCGT GGCAAGGTAC TCAGCGACAT GACAGTTGGT 450 
GTGGTGGGAA CGGGCCAGAT AGGCAAAGCG GTTATTGAGC GGCTGCGAGG 500 

40 ATTTGGATGT AAAGTGTTGG CTTATAGTCG CAGCCGAAGT ATAGAGGTAA 550 
ACTATGTACC GTTTGATGAG TTGCTGCAT^ ATAGCGATAT CGTTACGCTT 600 
CATGTGCCGC TCAATACGGA TACGCACTAT ATTATCAGCC ACGAACAAAT 650 
ACAGAGAATG AAGCAAGGAG CATTTCTTAT CAATACTGGG CGCGGTCCAC 700 
TTGTAGATAC CTATGAGTTG GTTAAAGCAT TAGAAAACGG GAAACTGGGC 750 

45 GGTGCCGCAT TGGATGTATT GGAAGGAGAG GAAGAGTTTT TCTACTCTGA 800 

TTGCACCCAA AAACCAATTG ATAATCAATT TTTACTTAAA CTTCAAAGAA 850 
TGCCTAACGT GATAATCACA CCGCATACGG CCTATTATAC CGAGCAAGCG 900 
TTGCGTGATA CCGTTGAAAA AACCATTAAA AACTGTTTGG ATTTTGAAAG 950 

GAGACAGGAG CATGAATAGA ATAAAAGTTG CAATACTGTT TGGGGGTTGC 10 00 

50 TCAGAGGAGC ATGACGTATC GGTAAAATCT GCAATAGAGA TAGCCGCTAA 1050 

CATTAATAAA GAAAAATACG AGCCGTTATA CATTGGAATT ACGAAATCTG 1100 

GTGTATGGAA AATGTGCGAA AAACCTTGCG CGGAATGGGA AAACGACAAT 1150 

TGCTATTCAG CTGTACTCTC GCCGGATAAA AAAATGCACG GATTACTTGT 1200 

TAAAAAGAAC CATGAATATG AAATCAACCA TGTTGATGTA GCATTTTCAG 12 50 

55 CTTTGCATGG CAAGTCAGGT GAAGATGGAT CCATACAAGG TCTGTTTGAA 13 00 

TTGTCCGGTA TCCCTTTTGT AGGCTGCGAT ATTCAAAGCT CAGCAATTTG 1350 

TATGGACAAA TCGTTGACAT ACATCGTTGC GAAAAATGCT GGGATAGCTA 14 00 

CTCCCGCCTT TTGGGTTATT AATAAAGATG ATAGGCCGGT GGCAGCTACG 14 50 

TTTACCTATC CTGTTTTTGT TAAGCCGGCG CGTTCAGGCT CATCCTTCGG 1500 

60 TGTGAAAAAA GTCAATAGCG CGGACGAATT GGACTACGCA ATTGAATCGG 1550 
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CAAGACAATA TGACAGCAAA ATCTTAATTG AGCAGGCTGT TTCGGGCTGT 1600 

GAGGTCGGTT GTGCGGTATT GGGAAACAGT GCCGCGTTAG TTGTTGGCGA 1650 

GGTGGACCAA ATCAGGCTGC AGTACGGAAT CTTTCGTATT CATCAGGAAG 1700 

TCGAGCCGGA AAAAGGCTCT GAAAACGCAG TTATAACCGT TCCCGCAGAC 17 50 

CTTTCAGCAG AGGAGCGAGG ACGGATACAG GAAACGGCAA AAAAAATATA 1800 

TAAAGCGCTC GGCTGTAGAG GTCTAGCCCG TGTGGATATG TTTTTACAAG 1850 

ATAACGGCCG CATTGTACTG AACGAAGTCA ATACTCTGCC CGGTTTCACG 1900 

TCATACAGTC GTTATCCCCG TATGATGGCC GCTGCAGGTA TTGCACTTCC 1950 

CGAACTGATT GACCGCTTGA TCGTATTAGC GTTAAAGGGG TGATAAGCAT 2000 

GGAAATAGGA TTTACTTTTT TAGATGAAAT AGTACACGGT GTTCGTTGGG 2 050 

ACGCTAAATA TGCCACTTGG GATAATTTCA CCGGAAAACC GGTTGACGGT 2100 

TATGAAGTAA ATCGCATTGT AGGGACATAC GAGTTGGCTG AATCGCTTTT 2150 

GAAGGCAAAA GAACTGGCTG CTACCCAAGG GTACGGATTG CTTCTATGGG 2200 

ACGGTTACCG TCCTAAGCGT GCTGTAAACT GTTTTATGCA ATGGGCTGCA 2250 

CAGCCGGAAA ATAACCTGAC AAAGGAAAGT TATTATCCCA ATATTGACCG 23 00 

AACTGAGATG ATTTCAAAAG GATACGTGGC TTCAAAATCA AGCCATAGCC 23 50 

GCGGCAGTGC CATTGATCTT ACGCTTTATC GATTAGACAC GGGTGAGCTT 24 00 

GTACCAATGG GGAGCCGATT TGATTTTATG GATGAACGCT CTCATCATGC 2450 

GGCAAATGGA ATATCATGCA ATGAAGCGCA AAATCGCAGA CGTTTGCGCT 2500 

CCATCATGGA AAACAGTGGG TTTGAAGCAT ATAGCCTCGA ATGGTGGCAC 2 550 

TATGTATTAA GAGACGAACC ATACCCCAAT AGCTATTTTG ATTTCCCCGT 2600 

TAAATAAACT TTTAACCGTT GCACGGACAA ACTATATAAG CTAACTCTTT 2650 

CGGCAGGAAA CCCGACGTAT GTAACTGGTT CTTAGGGAAT TTATATATAG 2700 

TAGATAGTAT TGAAGATGTA AGGCAGAGCG ATATTGCGGT CATTATCTGC 2750 

GTGCGCTGCG GCAAGATAGC CTGATAATAA GACTGATCGC ATAGAGGGGT 2 800 

GGTATTTCAC ACCGCCCATT GTCAACAGGC AGTTCAGCCT CGTTAAATTC 2850 

AGCATGGGTA TCACTTATGA AAATTCATCT ACATTGGTGA TAATAGTAAA 2900 

TCCAGTAGGG CGAAATAATT GACTGTAATT TACGGGGCAA AACGGCACAA 2 950 

TCTCAAACGA GATTGTGCCG TTTAAGGGGA AGATTCTAGA AATATTTCAT 3000 

ACTTCCAACT ATATAGTTAA GGAGGAGACT GAAAATGAAG AAGTTGTTTT 3050 

TTTTATTGTT ATTGTTATTC TTAATATACT TAGGTTATGA CTACGTTAAT 3100 

GAAGCACTGT TTTCTCAGGA AAAAGTCGAA TTTCAAAATT ATGATCAAAA 3150 

TCCCAAAGAA CATTTAGAAA ATAGTGGGAC TTCTGAAAAT ACCCAAGAGA 32 00 

AAACAATTAC AGAAGAACAG GTTTATCAAG GAAATCTGCT ATTAATCAAT 3250 

AGTAAATATC CTGTTCGCCA AGAAAGTGTG AAGTCAGATA TCGTGAATTT 33 00 

ATCTAT^CAT GACGAATTAA TAAATGGATA CGGGTTGCTT GATAGTAATA 3350 

TTTATATGTC AAAAGAAATA GCACAAAAAT TTTCAGAGAT GGTCAATGAT 34 00 

GCTGTAAAGG GTGGCGTTAG TCATTTTATT ATTAATAGTG GCTATCGAGA 3450 

CTTTGATGAG CAAAGTGTGC TTTACCAAGA AATGGGGGCT GAGTATGCCT 3500 

TACCAGCAGG TTATAGTGAG CATAATTCAG GTTTATCACT AGATGTAGGA 3550 

TCAAGCTTGA CGAAAATGGA ACGAGCCCCT GAAGGAAAGT GGATAGAAGA 3 600 

AAATGCTTGG AAATACGGGT TCATTTTACG TTATCCAGAG GACAAAACAG 3650 

AGTTAACAGG AATTCAATAT GAACCATGGC ATATTCGCTA TGTTGGTTTA 3700 

CCACATAGTG CGATTATGAA AGAAAAGAAT TTCGTTCTCG AGGAATATAT 3 750 

GGATTACCTA AAAGAAGAAA AAACCATTTC TGTTAGTGTA AATGGGGAAA 38 00 

AATATGAGAT CTTTTATTAT CCTGTTACTA AAAATACCAC CATTCATGTG 3 850 

CCGACTAATC TTCGTTATGA GATATCAGGA AACAATATAG ACGGTGTAAT 3900 

TGTGACAGTG TTTCCCGGAT CAACACATAC TAATTCAAGG AGGTAA 3946 



2) INFORMATION FOR SEQ ID NO: 1142 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 25 bases 

(B) TYPE: Nucleic acid 

(C) STRANDEDNESS : Single 

(D) TOPOLOGY: Linear 

(ii) MOLECULE TYPE: DNA 
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(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 1142 
GATCCTCTAA ATGATTCTCA GGTGG 



PCT/CAOO/01150 



2) INFORMATION FOR SEQ ID NO: 1143 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 25 bases 

(B) TYPE: Nucleic acid 

(C) STRANDEDNESS : Single 

(D) TOPOLOGY: Linear 

(ii) MOLECULE TYPE: DNA 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 1143 
CAATTAGCTT AGCAATAGGT GTTGG 



2) INFORMATION FOR SEQ ID NO: 1144 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 2 0 bases 

(B) TYPE: Nucleic acid 

(C) STRANDEDNESS: Single 

(D) TOPOLOGY: Linear 

(ii) MOLECULE TYPE: DNA 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 1144 
TGTYTTCCAA GGTTCAGCTC 



2) INFORMATION FOR SEQ ID NO: 1145 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 2 0 bases 

(B) TYPE: Nucleic acid 

(C) STRANDEDNESS: Single 

(D) TOPOLOGY: Linear 

(ii) MOLECULE TYPE: DNA 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 1145 
AACATATTKG GTTGATAGGT 



2) INFORMATION FOR SEQ ID NO: 1146 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 23 bases 

(B) TYPE: Nucleic acid 
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(C) STRANDEDNESS : Single 

(D) TOPOLOGY: Linear 

(ii) MOLECULE TYPE: DNA 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 1146 
GGGATTACCT ATGCCAATAT GAT 



2) INFORMATION FOR SEQ ID NO: 1147 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 24 bases 

(B) TYPE: Nucleic acid 

(C) STRANDEDNESS: Single 

(D) TOPOLOGY: Linear 

(ii) MOLECULE TYPE: DNA 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 1147 
AGCTGTGTTA GCVCGAACAT CTTG 



2) INFORMATION FOR SEQ ID NO: 1148 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 20 bases 

(B) TYPE: Nucleic acid 

(C) STRANDEDNESS: Single 

(D) TOPOLOGY: Linear 

(ii) MOLECULE TYPE: DNA 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 1148 
GACTTTGTTT GGCGTGATAT 



2) INFORMATION FOR SEQ ID NO: 1149 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 20 bases 

(B) TYPE: Nucleic acid 

(C) STRANDEDNESS: Single 

(D) TOPOLOGY: Linear 

(ii) MOLECULE TYPE: DNA 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 1149 
TCCYACWATT TCTTTTTGWG 



2) INFORMATION FOR SEQ ID NO: 1150 
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(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 20 bases 

(B) TYPE: Nucleic acid 

5 (C) STRANDEDNESS : Single 

(D) TOPOLOGY: Linear 

(ii) MOLECULE TYPE: DNA 

10 (xi) SEQUENCE DESCRIPTION: SEQ ID NO: 1150 

TGATAATCAC ACCGCATACG 20 



15 



25 



30 



2 ) INFORMATION FOR SEQ ID NO: 1151 



(i) SEQUENCE CHARACTERISTICS: 
(A) LENGTH: 2 0 bases 
20 (B) TYPE: Nucleic acid 

(C) STRANDEDNESS: Single 

(D) TOPOLOGY: Linear 



(ii) MOLECULE TYPE: DNA 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 1151 
TGCTGTCATA TTGTCTTGCC 20 

2) INFORMATION FOR SEQ ID NO: 1152 



(i) SEQUENCE CHARACTERISTICS: 
35 (A) LENGTH: 2 0 bases 

(B) TYPE: Nucleic acid 

(C) STRANDEDNESS: Single 

(D) TOPOLOGY: Linear 

40 (ii) MOLECULE TYPE: DNA 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO : 1152 
ATAAAGATGA TAGGCCGGTG 20 

45 

2) INFORMATION FOR SEQ ID NO: 1153 

50 (i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 2 0 bases 

(B) TYPE: Nucleic acid 

(C) STRANDEDNESS: Single 

(D) TOPOLOGY: Linear 

55 

(ii) MOLECULE TYPE: DNA 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 1153 

60 CTCGTATGTC CCTACAATGC 20 
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2) INFORMATION FOR SEQ ID NO: 1154 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 2 0 bases 

(B) TYPE: Nucleic acid 

(C) STRANDEDNESS : Single 

(D) TOPOLOGY: Linear 

(ii) MOLECULE TYPE: DNA 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 1154 
GTTTGAAGCA TATAGCCTCG 



2) INFORMATION FOR SEQ ID NO: 1155 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 21 bases 

(B) TYPE: Nucleic acid 

(C) STRANDEDNESS: Single 

(D) TOPOLOGY: Linear 

(ii) MOLECULE TYPE: DNA 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 1155 
CAGTGCTTCA TTAACGTAGT C 



2 ) INFORMATION FOR SEQ ID NO: 1156 

(i) SEQUENCE CHARACTERISTICS: \ 

(A) LENGTH: 23 bases 

(B) TYPE: Nucleic acid 

(C) STRANDEDNESS: Single 

(D) TOPOLOGY: Linear 

(ii) MOLECULE TYPE: DNA 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 1156 
GTTGAAATGC ATCACGAACA ATT 



2) INFORMATION FOR SEQ ID NO: 1157 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 23 bases 

(B) TYPE: Nucleic acid 

(C) STRANDEDNESS: Single 

(D) TOPOLOGY: Linear 

(ii) MOLECULE TYPE: DNA 
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(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 1157 
AAGAACGTTT CAGTTAAGGA AAT 
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2) INFORMATION FOR SEQ ID NO: 1158 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 19 bases 

(B) TYPE: Nucleic acid 

(C) STRANDEDNESS : Single 

(D) TOPOLOGY : Linear 

(ii) MOLECULE TYPE: DNA 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 1158 
AAGAGGTAAT GTCTGTGGT 



2) INFORMATION FOR SEQ ID NO: 1159 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 18 bases 

(B) TYPE: Nucleic acid 

(C) STRANDEDNESS: Single 

(D) TOPOLOGY: Linear 

(ii) MOLECULE TYPE: DNA 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 1159 
TGAAGGTTTG CCAGGTGA 



2) INFORMATION FOR SEQ ID NO: 1160 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 24 bases 

(B) TYPE: Nucleic acid 

(C) STRANDEDNESS: Single 

(D) TOPOLOGY: Linear 

(ii) MOLECULE TYPE: DNA 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 1160 
CGTTTCTGTT AAAGAAATTA GAAG 



2) INFORMATION FOR SEQ ID NO: 1161 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 18 bases 

(B) TYPE: Nucleic acid 
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(C) STRANDEDNESS : Single 

(D) TOPOLOGY: Linear 

(ii) MOLECULE TYPE: DNA 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 1161 
TCCAGGTGAT AACGTTGG 18 

2) INFORMATION FOR SEQ ID NO: 1162 



(i) SEQUENCE CHARACTERISTICS: 
15 (A) LENGTH: 18 bases 

(B) TYPE: Nucleic acid 

(C) STRANDEDNESS: Single 

(D) TOPOLOGY: Linear 

20 (ii) MOLECULE TYPE: DNA 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 1162 
CAAGTCCGTG GAAATGCA 18 

25 

2) INFORMATION FOR SEQ ID NO: 1163 

30 (i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 21 bases 

(B) TYPE: Nucleic acid 

(C) STRANDEDNESS: Single 

(D) TOPOLOGY: Linear 

35 

(ii) MOLECULE TYPE: DNA 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 1163 
40 GTTGGTTTCA ACGTTAAGAA C 21 



45 



2) INFORMATION FOR SEQ ID NO: 1164 



(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 18 bases 

(B) TYPE: Nucleic acid 

(C) STRANDEDNESS: Single 
50 (D) TOPOLOGY: Linear 

(ii) MOLECULE TYPE: DNA 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 1164 

55 

GGTTTCAACG TCAAGAAC 18 



60 2 ) INFORMATION FOR SEQ ID NO: 1165 
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(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 18 bases 

(B) TYPE: Nucleic acid 

(C) STRANDEDNESS : Single 

(D) TOPOLOGY: Linear 

(ii) MOLECULE TYPE: DNA 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO : 1165 
GGTTTCAACG TGAAGAAC 



2) INFORMATION FOR SEQ ID NO: 1166 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 22 bases 

(B) TYPE: Nucleic acid 

(C) STRANDEDNESS: Single 

(D) TOPOLOGY: Linear 

(ii) MOLECULE TYPE: DNA 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO : 1166 
ACGTTAAGAA TGTTTCTGTC AA 



2) INFORMATION FOR SEQ ID NO: 1167 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 14 bases 

(B) TYPE: Nucleic acid 

(C) STRANDEDNESS: Single 

(D) TOPOLOGY: Linear 

(ii) MOLECULE TYPE: DNA 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO : 1167 
GTTGGTTTCA ACGT 



2) INFORMATION FOR SEQ ID NO: 1168 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 2 0 bases 

(B) TYPE: Nucleic acid 

(C) STRANDEDNESS: Single 

(D) TOPOLOGY: Linear 

(ii) MOLECULE TYPE: DNA 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 1168 
GAACAATTGG TTGAAGGTGT 
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2) INFORMATION FOR SEQ ID NO: 1169 



10 



15 



20 



25 



30 



35 



40 



45 



50 



55 



60 



(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 2160 bases 

(B) TYPE: Nucleic acid 

(C) STRANDEDNESS : Double 

(D) TOPOLOGY: Linear 

(ii) MOLECULE TYPE: Genomic DNA 

(vi) ORIGINAL SOURCE: 

(A) ORGANISM: Streptococcus pneumoniae 

(B) STRAIN: SP-665 

(C) ACCESSION NUMBER: AF139883 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 1169 



ATGAACAAAC 
CTTT^GCTTG 
ACTACGTTAG 
ACTTCTAGTA 
TTCTGAACGC 
TTAAGGCAAT 
ATTGATTCCA 
TTCCCTCCAA 
ACTTTTCAAC 
GCTTGGTTAG 
GACCTACTAT 
AGACAGCAGC 
CCTCAGTTAG 
CCCCTATTCA 
CTGAAATGAA 
GTCAATACAC 
TTACCCTGCT 
AACAAGAAAC 
AATGTAGACC 
TCAATACGTC 
TAGATGTTTC 
GCAAGTAACG 
CTGGGGTTCT 
ACGGTGTTTA 
TATCCGGGAA 
TAATATTACT 
TTGAGACTTT 
GGTCTTGGTA 
TAATACAACA 
CTGCTGCTTA 
TATATCCATA 
TGTCGGAACT 
TGATGAAAAC 
TGGCTTCCTC 
AGTTGAAAAC 
TTGTTGGTTA 
AATCGTTTAA 
TTATCGCTCA 
GGACGATGCC 
GGAGCTCGCC 



CAACGATTCT 
GTTATCGCAG 
CAAGGCTCCT 
AAATCTACGA 
CGCGTCAATG 
CGTTTCTATC 
TCCGTATCCT 
GGTGGATCAA 
CTCGACTTCC 
CGATTCAGTT 
ATAAATAAGG 
TCAAAACTAC 
CCTTGCTGGC 
CATCCAGAAG 
AAATCAAGGT 
CAATTACTGA 
TACATGGATA 
TGGCTATAAC 
AAGAAGCTCA 
TCTTACCCTG 
AAATGGTAAA 
TTTCATTTGG 
GCTATGAAAC 
TGATTCCACT 
CAAGCACACC 
CTGCAATATG 
GAATAAGGTC 
TCGACTATCC 
GAATCTAATA 
TGCTGCCTTT 
AAGTCGTCTT 
CGTGCCATGA 
AGTCTTGACT 
AAGCTGGTAA 
CACATCAAGA 
TACTCGTAAG 
CTCCTATCGT 
ATGATAACGT 
AGACGGACTT 
CAATATGGAC 



GCGCCTAATC 
CCATTGTCTT 
AGCCTATCCG 
CAATAAAAAT 
CCCAAGCTAA 
GAAGACCATC 
GGGAGCTTTC 
CTCTCACCCA 
GACCAGACTA 
AGAACAAAAA 
TCTACATGTC 
TATGGTAAAG 
TGGAATGCCT 
CAGCCCAAGA 
TACATCTCTG 
TGGACTACAA 
ATTACCTCAA 
CTTCTAACTA 
AAAACATCTG 
ACGATGATTT 
GTCATCGCCC 
TACCAACCAA 
CAATCACCGA 
GCAACTATGG 
TGTCTACAAC 
CTCTTCAACA 
GGTCTAGATA 
/yVGCATGCAT 
AACAATACGG 
GCAAATGGTG 
CAGTGATGGA 
AGGAAACGAC 
TATGGAACTG 
AACAGGAACC 
ACACTGGCTA 
TATTCTATGG 
TGGAGATGGT 
ATCTATCAGA 
TTCAGAAACG 
TGAACCCTCT 



AAGTATCTGA 
AGGCGGAGGA 
AGAGTAAACT 
CAACTCATTG 
TGATATTCCC 
GCTTCTTCGA 
TTGCGCAATC 
ACAGTTGATT 
TTTCTCGTAA 
GCAACCAAAC 
TAATGGCAAC 
ACCTCAATAA 
CAGGCACCAA 
CCGCCGAAAC 
CTGAACAGTA 
AGTCTCAT^VT 
GGAGGTCATC 
CTGGGATGGA 
TGGGATATCT 
GCAAGTCGCA 
AACTTGGAGC 
GCTGTGGAAA 
TTATGCACCT 
TTAATGATAT 
TGGGATAGAG 
ATCACGAAAT 
GAGCTAAAAC 
TATGCAAACG 
AGCAAGTAGT 
GCACTTACTA 
AGTAAAAAAG 
AGCCTATATG 
GGCGTGGAGC 
TCTAACTATA 
TGTAGCTCCA 
CTGTATGGAC 
TTCCTAGTTG 
AGATACTCAT 
GGGAATTTGT 
ACTCAACAAT 



GCATTAGCTT 


50 


GTTTTTTTCT 


100 


AGTTGCAACA 


150 


CTGACTTGGG 


200 
^ \j \j 


ACAGATTTGG 


250 
^ ^ \j 


CCACAGGGGG 


300 


TGCAAAGTAA 


350 
•J >j \j 


AAGTTGACTT 


4 00 
^ \j \j 


GGCTCAGGAA 


450 


AGGAAATCTT 


500 


TATGGAATGC 


550 


TTTAAGTTTA 


600 


ACCAATATGA 


650 


TTGGTCTTAT 


700 


TGAGAAAGCA 


750 


CAGCAAGTAA 


800 


AATCAAGTAG 


850 


TGTTTACACA 


900 


ACAACTCCGA 


950 


TCTACGGTCG 


1000 


TCGTCACCAA 


1050 


CCAATCGTGA 


1100 


GCCATAGAAT 


1150 


TCCTTATAAC 


1200 


CATATTTCGG 


1250 


GTCACAGCCG 


1300 


CTTCCTTAAT 


1350 


CCATTTCAAG 


1400 


GAAAAAATGG 


1450 


TAAACCAATG 


1500 


AGTTCTCTAA 


1550 


ATGACCGACA 


1600 


CTATCTTCCT 


1650 


CAGATGAGGA 


1700 


GATGAAATGT 


1750 


AGGTTATTCG 


1800 


CAGCTAAAGT 


1850 


CCAGAAGACT 


1900 


ATTCAAAAAT 


1950 


CCTCAACAGC 


2000 
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TGAAAGTTCA AGCTCATCAT 
CCACTCCAAG CACAAATAAT 
CAACAATCAA ATACAACCCC 
ACAACCATAA 



CAGATAGTTC AACTTCACAG 
AGTACGACTA CCAATCCTAA 
TGATCAACAA AATCAGAATC 
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TCTAGCTCAA 2 050 

CAATAAT ACQ 2100 
CTCAACCAGC 2150 
2160 



2) INFORMATION FOR SEQ ID NO: 1170 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 20 bases 

(B) TYPE: Nucleic acid 

(C) STRANDEDNESS : Single 

(D) TOPOLOGY: Linear 

(ii) MOLECULE TYPE: DNA 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 1170 
ACGAATTGGA CTACGCAATT 



2) INFORMATION FOR SEQ ID NO: 1171 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 20 bases 

(B) TYPE: Nucleic acid 

(C) STRANDEDNESS: Single 

(D) TOPOLOGY: Linear 

(ii) MOLECULE TYPE: DNA 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 1171 
ACGAGGATGA TTTGATTGTC 



2) INFORMATION FOR SEQ ID NO: 1172 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 1560 bases 

(B) TYPE: Nucleic acid 

(C) STRANDEDNESS: Double 

(D) TOPOLOGY: Linear 

(ii) MOLECULE TYPE: Genomic DNA 

(vi) ORIGINAL SOURCE: 

(A) ORGANISM: Streptococcus pnexjmonia.e 

(B) STRAIN: 64147 

(C) ACCESSION NUMBER: XI 3 136 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 1172 

GATCCTCTAA ATGATTCTCA GGTGGCTGTT ATTGCCTCTA TTTCAAAGGA 50 

GATGCCTGGC ATTAGTATTT CTACTTCTTG GGATAGA/AG GTTTTGGAAA 100 

CTTCCCTTTC TTCTATAGTT GGGAGTGTAT CCAGTGAAAA AGCTGGTCTC 150 

CCAGCGGAAG AAGCAGAAGC CTATCTTAAA AAAGGCTATT CTCTAAATGA 200 
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CCGTGTAGGA ACCTCCTATT TGGAAAAGCA 
GAAAACGCTC GGTAAAAGAA ATCCATCTGG 
AGCGTGGATA CAATTGAGGA AGGTAGTAAG 
CATTGATTTG GCCTTCCAAG ATAGCGTGGA 
TCAATTCAGA GTTGGGAAAT GGTGGAGCCA 
GCAGTCGCCC TTAACCCAAA AACAGGTGCT 
TAAACATGAC TTGAAAACAG GAGAGTTGAC 
TAACCAATGT CTTTGTCCCA GGTTCGGTTG 
TCTGGTTGGG AAAATGGAGT CTTGTCAGGA 
GTCCATTGTC TTTCAAGGTT CAGCTCCAAT 
TTTCTAGACC AATGCCGATT ACGGCGGTTC 
AATGCTTATA TGGTCCAAAC AGCCCTAGGT 
ACCCAATATG TTTGTCGGCA CCAGCAATCT 
TGCGTTCAAC CTTTGGTGAA TATGGTTTGG 
CTACCAGATG AATCTACTGG ATTTGTTCCC 
TTTCATTACC AATGCCTTTG GGCAGTTTGA 
TGGCTCAGTA TGTAGCAACT ATTGCAAATG 
CGTATTGTTG AAGGCATTTA TGGTAATAAT 
CTTGATTCAG CAACTGCAAC CGACAGAGAT 
ACTCCGATAT GAGCATCTTG CACCAAGGTT 
ACTAGTGAAT TGACAACTGG ACGTGCCTTT 
CATTAGCGGA AAAACAGGTA CAGCCGAAAG 
AAGCAACCAA TACCAATGCG GTGGCCTATG 
ATCGCTGTAG CTGTTGTCTT CCCTCATAAC 
CGGACCTTCC ATTGCGCGCG ATATTATCAA 
CAATGAATTA GAAAGGAACA TATGCTTTAT 
AATTGACAGT TATTCGAAGT TACCGGGTAT 
GTTTGGCCTT 
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ATATGAAGAG ACCTTACAAG 2 50 

ACAAATATGG CAACATGGAA 300 

GGAAACAATA TCAAGCTGAC 350 

TGCTTTGCTG AAAAGTTATT 400 

AGTATTCTGA AGGTGTCTAT 450 

GTTTTGTCTA TGTCAGGAAT 500 

GCCGGATTCC TTGGGT^CGG 550 

TCAAAGCAGC GACCATCAGC 600 

AATCAGACCT TGACAGACCA 650 

TAATTCTTGG TATCCTGCCT 700 

AGGCTCTAGA GTATTCATCC 750 

CTTATGGGGC AGACCTATCA 800 

AGAGTCTGCT ATGGGGAAAT 850 

GTTCTGCGAC CGGAATTGAC 900 

AAAGAGTATA GCTTTGCTAA 950 

TAACTATACG CCGATGCAGT 1000 

ATGGTGTTCG TGTGGCTCCT 1050 

GATAAGGGAG GACTGGGTGA 1100 

GAATAAGGTC AATATATCCG 1150 

TTTATCAGGT TGCCCATGGT 12 00 

TCAAATGGCG CCTTGGTATC 12 50 

CTATGTGGCA GATGGTCAGC 13 00 

CCCCATCTGA TAATCCCCAA 1350 

ACCAACCTTA CAAATGGTGT 1400 

CCTCTATAAC CAACATCATC 1450 

CCAACACCTA TTGCTAAGCT 1500 

CGGGATTAAG ACGGCTACCC 1550 

1560 



2 ) INFORMATION FOR SEQ ID NO: 1173 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 2 007 bases 

(B) TYPE: Nucleic acid 

(C) STRANDEDNESS : Double 

(D) TOPOLOGY: Linear 

(ii) MOLECULE TYPE: Genomic DNA 

(vi) ORIGINAL SOURCE: 

(A) ORGANISM: Streptococcus pneumoniae 

(B) STRAIN: CS109 

(C) ACCESSION NUMBER: 24 9096 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 1173 

GAGTCCCGAT TGCTGAGGAT GCAACCTCCT ATAATGTCTA TGCGGTCATT 50 

GATGAGAACT ATAAGTCAGC AACGGGTAAG ATTCTTTACG TAGAAAAAAC 100 

ACAATTTAAC AAGGTTGCAG AGGTCTTTCA TAAGTATCTG GACATGGAAG 150 

AATCCTATGT AAGAGAGCAA CTCTCGCAAC CTAATCTCAA GCAAGTTTCC 200 

TTTGGAGCAA AGGGAAATGG GATTACCTAT GCCAATATGA TGACTATCAA 250 

AAAAGAGTTG GAAACTGCAG AGGTCAAGGG GATTGATTTT ACAACCAGTC 3 00 

CCAATCGTAG TTATCCAAAC GGACAATTTG CTTCTAGTTT TATCGGTCTA 350 

GCTCAGCTCC ATGAAAATGA AGATGGCAGC AAGAGCTTGC TGGGAACTTC 400 

TGGAATGGAG AGTTCCTTGA ACAGTATTCT TGCAGGGACA GACGGCATTA 450 

TTACCTATGA Ai\AGGATCGT CTGGGCAATA TTGTACCCGG AACAGAACTG 500 

GTATCGCAAC AAACTGTGGA TGGCAAGGAT GTTTATACAA CATTGTCTAG 550 

TCCGCTACAA TCTTTCATGG AAACTCAGAT GGATGCCTTT CTAGAAAAAG 600 
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TAAAAGGTAA GTATATGACC 
ATTCTCGCTA CCACCCAACG 
AATCACTGAG GACTTTGTTT 
AACCAGGATC AGCCTTTAAG 
5 AATACCTTCC CAAGTGGAGA 
GGATGCGACG ACTCGAGATT 
GGATGATGAC TTTCTCACAA 
AGTCTACTTG AACAAAAAAT 
ACGCTTTAAA TTTGGGGTTC 

10 CTGGTCAACT TCCAGCTGAT 
GGGCAAGGAA TTTCAGTGAC 
TATTGCTAAT GATGGAGTTA 
ATGATACTAA CAATCAGTCT 
AATCCTGTTT CCAAAGAGGC 

15 AGTTGGGACG GACCCTCTAT 
AGCCAATTAT AACAGTTCCT 
GCTCAAATCG CTGATGAGAA 
TTATATTTTC TCAGTTGTGA 
TCTTGTATGT AACGGTTCAA 

20 GGAGAATTTG CCACCCCAAT 
TCTCAATCTT CAATCTCCAG 
CTTCTTATGC AATGCCTAGC 
GAAGCCTTAC GCCGAAATAT 
AAAGATTAAA GAGACTTCTG 

25 AACAAGTTCT CCTTTTATCG 
AGCTGGAAAA AAGAGACTGC 
ACTGGAATTT GAAGGTTCAG 
CTAATACAGC TATCAAAAAC 
TAATATG 

30 



GCGACCTTGG TCAGTGCAAA 
ACCTACCTTT AATGCAGATA 
GGCGTGATAT TCTTTATCAA 
GTCATGATGT TAGCTTCTTC 
ATACTTCAAT AGCAGTGAAT 
GGGATGTTAA TGAGGGTTTG 
GGTTTCGCTC ACTCCAGTAA 
GGGAGATGCT ACTTGGTTGG 
CAACTCGCTT TGGCTTGACA 
AATATTGTTA GTATTGCTCA 
ACAAACACAA ATGCTTCGTG 
TGCTGGAGCC AAAATTTATA 
GTACGTAAGT CACAAAAAGA 
AGCAAGCACA ACTCGAAATC 
ATGGAACTAT GTATAATCAC 
GGACAAAATG TAGCAGTTAA 
AAATGGAGGA TACTTGGTTG 
CTATGAATCC TGCTGAAAAT 
CAGCCTGAGC ATTATTCAGG 
CTTGGAGCGG GCTTCAGCTA 
CCAAAAATTT AGATAAAGTT 
ATCAAGGATA TTTCACCTGG 
TGTGCAACCA ATCGTTGTAG 
TAGAAGAAGG GACCAATCTT 
GATAAGGTAG AAGAAATTCC 
CGAGACCTTT GCTAAATGGT 
GTTCCGTTGT TCAGAAGCAA 
ATTAAAAAAA TTAAATTAAC 
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GACCGGTGAA 


650 


CTAAAGAAGG 


700 


AGTAACTATG 


750 


TATTGATAAT 


800 


TCAAAATAGC 


850 


ACTACTGGTG 


900 


TGTTGGAACG 


950 


ATTATCTAAA 


1000 


GATGAATACG 


1050 


AAGCTCATTT 


1100 


CCTTTACAGC 


1150 


AGTGCTATTT 


1200 


AATAGTAGGA 


1250 


ACATGATCTT 


1300 


TACACAGGAA 


1350 


ATCCGGTACG 


1400 


GTTCTACCAA 


1450 


CCTGATTTTA 


1500 


TATCCAGTTG 


1550 


TGAAAGAATC 


1600 


ACGACAGAAT 


1650 


TGAGTTGGCG 


1700 


GTACTGGAAC 


1750 


GCACCAAACC 


1800 


AGACATGTAT 


1850 


TGGATATTGA 


1900 


GATGTTCGGA 


1950 


TTTAGGAGAC 


2000 




2007 



2 ) INFORMATION FOR SEQ ID NO: 1174 

35 (i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 18 bases 

(B) TYPE: Nucleic acid 

(C) STRANDEDNESS : Single 

(D) TOPOLOGY: Linear 

40 

(ii) MOLECULE TYPE: DNA 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 1174 
45 GAACGTG6TG AAGTTCGC 18 



50 



2 ) INFORMATION FOR SEQ ID NO: 1175 



(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 21 bases 

(B) TYPE: Nucleic acid 

(C) STRANDEDNESS: Single 
55 (D) TOPOLOGY: Linear 

(ii) MOLECULE TYPE: DNA 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 1175 

60 
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GTTACTGGTG TAGAAATGTT C 21 



5 2) INFORMATION FOR SEQ ID NO: 1176 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 19 bases 

(B) TYPE: Nucleic acid 
10 (C) STRANDEDNESS : Single 

(D) TOPOLOGY: Linear 

(ii) MOLECULE TYPE: DNA 
15 (xi) SEQUENCE DESCRIPTION: SEQ ID NO: 1176 

TACTGGTGTA GAAATGTTC 19 



20 



30 



35 



2) INFORMATION FOR SEQ ID NO: 1177 



(i) SEQUENCE CHARACTERISTICS: 
(A) LENGTH: 20 bases 
25. (B) TYPE: Nucleic acid 

(C) STRANDEDNESS: Single 

(D) TOPOLOGY: Linear 



(ii) MOLECULE TYPE: DNA 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 1177 
GCTCAACAAG TTCCAGATTA 20 

2) INFORMATION FOR SEQ ID NO : 1178 



(i) SEQUENCE CHARACTERISTICS: 
40 (A) LENGTH: 2456 bases 

(B) TYPE: Nucleic acid 

(C) STRANDEDNESS: Double 

(D) TOPOLOGY: Linear 

45 (ii) MOLECULE TYPE: Genomic DNA 

(vi) ORIGINAL SOURCE: 

(A) ORGANISM: Staphylococcus aureus 

(B) STRAIN: NCTC832 5 

50 (C) ACCESSION NUMBER: X52593 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 1178 

ATGAACTGAT TATACTTAAC ATTAAAAAAG ATGATAACAC CTTCTACACC 50 

55 TCCATATCAC AAAAAATTAT AACATTATTT TGACATAAAT ACTACATTTG 100 

TAATATACTA CAAATGTAGT CTTATATAAG GAGGATATTG ATQAAAAAGA 150 

TAAAAATTGT TCCACTTATT TTAATAGTTG TAGTTGTCGG GTTTGGTATA 200 

TATTTTTATG CTTCAAAAGA TAAAGAAATT AATAATACTA TTGATGCAAT 250 

TGAAGATAAA AATTTCAAAC AAGTTTATAA AGATAGCAGT TATATTTCTA 3 00 

60 AAAGCGATAA TGGTGAAGTA GAAATGACTG AACGTCCGAT AAAAATATAT 350 
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AATAGTTTAG GCGTTAAAGA 
AGTATCTAAA AATAAAAAAC 
ACTACGGTAA CATTGATCGC 
GGTATGTGGA AGTTAGATTG 
GAAAGACCAA AGCATACATA 
TTTTAGACCG AAACAATGTG 
TTAGGCATCG TTCCAAAGAA 
TAAAGAACTA AGTATTTCTG 
TTGGGTACAA GATGATACCT 
GATGAATATT TAAGTGATTT 
AACAGAAAGT CGTAACTATC 
GTTATGTTGG TCCCATTAAC 
GGCTATAAAG ATGATGCAGT 
CGATAAAAAG CTCCAACATG 
ATAATAGCAA TACAATCGCA 
GGCAAAGATA TTCAACTAAC 
TAACAACATG AAAAATGATT 
CAGGTGAATT ATTAGCACTT 
TTTATGTATG GCATGAGTAA 
Ai^AAGAACCT CTGCTCAACA 
CTCAAAAAAT ATTAACAGCA 
GATAAT^CAA GTTATAAAAT 
TTGGGGTGGT TACAACGTTA 
ACTTT^AAACA AGCAATAGAA 
GCACTCGAAT TAGGCAGTAA 
TGTTGGTGAA GATATACCAA 
CAAACAAAAA TTTAGATAAT 
CAAGGTGAAA TACTGATTAA 
ATTAGAAAAT AATGGCAATA 
AAAACAAAGT TTGGAAGAAA 
TTAAATGATG GTATGCAACA 
TTATAGATCT TATGCAAACT 
AAATGAAACA AGGAGAAAGT 
GATAAAGATA ATCCAAACAT 
AGATAAAGGA ATGGCTAGCT 
ATGAGCTATA TGAGAACGGT 
AACAGTGAAG CAATCCGTAA 
TATTAATAAG TGCTGTTACT 
TTGTATGTTG AAAGTGACAC 
TTTCTTATTT GTAATTTCAG 
AGTTTAATAA ATTTAACGTT 
TCCGTATTTA CCTTCTTCTA 
GCATGC 



TATAAACATT CAGGATCGTA 
GAGTAGATGC TCAATATAAA 
AACGTTCAAT TTAATTTTGT 
GGATCATAGC GTCATTATTC 
TTGAA/^TTT AAAATCAGAA 
GAATTGGCCA ATACAGGAAC 
TGTATCTAAA AAAGATTATA 
AAGACTATAT CAACAACAAA 
TCGTTCCACT TTAAAACCGT 
CGCAAAAAAA TTTCATCtTA 
CTCTAGAAAA AGCGACTTCA 
TCTGAAGAAT TAAAACAAAA 
TATTGGTAAA AAGGGACTCG 
AAGATGGCTA TCGTGTCACA 
CATACATTAA TAGAGAAAAA 
TATTGATGCT AAAGTTCAAA 
ATGGCTCAGG TACTGCTATC 
GTAAGCACAC CTTCATATGA 
CGAAGAATAT AATAAATTAA 
AGTTCCAGAT TACAACTTCA 
ATGATTGGGT TAAATAACAA 
CGATGGTAAA GGTTGGCAAA 
CAAGATATGA AGTGGTAAAT 
TCATCAGATA ACATTTTCTT 
GAAATTTGAA AAAGGCATGA 
GTGATTATCC ATTTTATAAT 
GAAATATTAT TAGCTGATTC 
CCCAGTACAG ATCCTTTCAA 
TTAACGCACC TCACTTATTA 
AATATTATTT CCAAAGAAAA 
AGTGGTAAAT AAAACACATA 
TAATTGGCAA ATCCGGTACT 
GGCAGACAAA TTGGGTGGTT 
GATGATGGCT ATTAATGTTA 
ACAATGCCAA AATCTCAGGT 
AATAAAAAAT ACGATATAGA 
CGATGGTTGC TTCACTGTTT 
TCTCCCTTAA ATACAATTTC 
TGTAACGAGT CCATTTTCTT 
CGATAACGTA CAATGTATTA 
ATTCATTTGT GTTCCTGCTA 
CCCATAATTT AAATGATATT 
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AAATAATU^UVA 


400 


ATTAAAACAA 


450 


TAAAGAAGAT 


500 


CAGGAATGCA 


550 


CGTGGTAAAA 


600 


ACATATGAGA 


650 


AAGCAATCGC 


700 


TGGATCAAAA 


750 


TAAAAAAATG 


800 


CAACTAATGA 


850 


CATCTATTAG 


900 


AGAATATAAA 


950 


AAAAACTTTA 


1000 


ATCGTTGACG 


1050 


GAAAAAAGAT 


1100 


AGAGTATTTA 


1150 


CACCCTCAAA 


1200 


CGTCTATCCA 


1250 


CCGAAGATAA 


1300 


CCAGGTTCAA 


1350 


AACATTAGAC 


1400 


AAGATAAATC 


1450 


GGTAATATCG 


1500 


TGCTAGAGTA 


1550 


AAAAACTAGG 


1600 


GCTCAAATTT 


1650 


AGGTTACGGA 


1700 


TCTATAGCGC 


1750 


AAAGACACGA 


1800 


TATCAATCTA 


1850 


AAGAAGATAT 


1900 


GCAGAACTCA 


1950 


TATATCATAT 


2000 


AAGATGTACA 


2050 


AAAGTGTATG 


2100 


TGAATAACAA 


2150 


TATTATGAAT 


2200 


TTCATTTTCA 


2250 


TTTTTATGGA 


2300 


CCTGGTATAC 


2350 


CAACTTCTTC 


2400 


GAAAGTGTAT 


2450 




2456 



2) INFORMATION FOR SEQ ID NO: 1179 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 2 0 bases 

(B) TYPE: Nucleic acid 

(C) STRANDEDNESS : Single 

(D) TOPOLOGY: Linear 

(ii) MOLECULE TYPE: DNA 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 1179 
ATTTGGTGAC GGGTGACTTT 
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10 



20 



2 ) INFORMATION FOR SEQ ID NO: 1180 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 19 bases 

(B) TYPE: Nucleic acid 

(C) STRANDEDNESS: Single 

(D) TOPOLOGY: Linear 



(ii) MOLECULE TYPE: DNA 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 1180 
15 TCCACCGTTG CCAATCGCA 19 



2) INFORMATION FOR SEQ ID NO: 1181 



(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 2 0 bases 

(B) TYPE: Nucleic acid 

(C) STRANDEDNESS: Single 
25 (D) TOPOLOGY: Linear 

(ii) MOLECULE TYPE: DNA 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO : 1181 

30 

AGCAGCTTAC TAGATGCCGT 20 



35 2) INFORMATION FOR SEQ ID NO: 1182 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 2 0 bases 

(B) TYPE: Nucleic acid 
40 (C) STRANDEDNESS: Single 

(D) TOPOLOGY: Linear 

(ii) MOLECULE TYPE: DNA 
45 (xi) SEQUENCE DESCRIPTION: SEQ ID NO: 1182 

AACTGCAAGA GATCCTTTGG 20 



50 



60 



2) INFORMATION FOR SEQ ID NO: 1183 



(i) SEQUENCE CHARACTERISTICS: 
(A) LENGTH: 2 53 5 bases 
55 (B) TYPE: Nucleic acid 

(C) STRANDEDNESS: Double 

(D) TOPOLOGY: Linear 



(ii) MOLECULE TYPE: Genomic DNA 
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(vi) ORIGINAL SOURCE: 

(A) ORGANISM: Streptococcus pneumoniae 

(B) STRAIN: 175 

(C) ACCESSION NUMBER: M18729 



(xii) SEQUENCE DESCRIPTION: SEQ ID NO: 1183 



ATGGCGATAG AAAAGCTATC ACCCGGCATG CAACAGTATG TGGATATTAA 50 

10 AAAGCAATAT CCAGATGCTT TTTTGCTCTT TCGGATGGGT GATTTTTATG 100 

AATTATTTTA TGAGGATGCG GTCAATGCTG CGCAGATTCT GGAAATTTCC 150 

TTAACGAGTC GCAACAAGAA TGCCGACAAT CCGATCCCTA TGGCGGGTGT 200 

TCCCTATCAT TCTGCCCAAC AGTATATCGA TGTCTTGATT GAGCAGGGTT 250 

ATAAGGTGGC TATCGCAGAG CAGATGGAAG ATCCTAAACA AGCAGTTGGG 300 

15 GTTGTTAAAC GAGAGGTTGT TCAGGTCATT ACGCCAGGGA CAGTGGTCGA 3 50 

TAGCAGTAAG CCGGACAGTC AGAATAATTT TTTGGTTTCC ATAGACCGCG 400 

AAGGCAATCA ATTTGGCCTA GCTTATATGG ATTTGGTGAC GGGTGACTTT 450 

TATGTGACAG GTCTTTTGGA TTTCACGCTG GTTTGTGGGG T^AATCCGTAA 500 

CCTCAAGGCT CGAGAAGTGG TGTTGGGTTA TGACTTGTCT GAGGAAGAAG 550 

20 AACAAATCCT CAGCCGCCAG ATGAATCTGG TACTCTCTTA TGAAAAAGAA 600 

AGCTTTGAAG ACCTTCATTT ATTGGATTTG CGATTGGCAA CGGTGGAGCA 650 

AACGGCATCT AGTAAGCTGC TCCAGTATGT TCATCGGACT CAGATGAGGG 700 

AATTGAACCA CCTCAAACCT GTTATCCGCT ACGAAATTAA GGATTTCTTG 750 

CAGATGGATT ATGCGACCAA GGCTAGTCTG GATTTGGTTG AGAATGCTCG 800 

25 CTCAGGTAAG AAACAAGGCA GTCTTTTCTG GCTTTTGGAT GAAACCAAAA 850 

CGGCTATGGG GATGCGTCTC TTGCGTTCTT GGATTCATCG CCCCTTGATT 900 

GATAAGGAAC GAATCGTCCA ACGTCAAGAA GTAGTGCAGG TCTTTCTCGA 950 

CCATTTCTTT GAGCGTAGTG ACTTGACAGA CAGTCTCAAG GGTGTTTATG 1000 

ACATTGAGCG CTTGGCTAGT CGTGTTTCTT TTGGCAAAAC CAATCCAAAG 1050 

30 GATCTCTTGC AGTTGGCGAC TACCTTGTCT AGTGTGCCAC GGATTCGTGC 1100 

GATTTTAGAA GGGATGGAGC AACCTACTCT AGCCTATCTC ATCGCACAAC 1150 

TGGATGCAAT CCCTGAGTTG GAGAGTTTGA TTAGCGCAGC GATTGCTCCT 1200 

GAAGCTCCTC ATGTGATTAC AGATGGGGGA ATTATCCGGA CTGGATTTGA 12 50 

TGAGACTTTA GACAAGTATC GTTGCGTTCT CAGAGAAGGG ACTAGCTGGA 13 00 

35 TTGCTGAGAT TGAGGCTAAG GAGCGAGAAA ACTCTGGTAT CAGCACGCTC 1350 

AAGATTGACT ACAATAA7VAA GGATGGCTAC TATTTTCATG TGACCAATTC 14 00 

GCAACTGGGA AATGTGCCAG CCCACTTTTT CCGCAAGGCG ACGCTGAAAA 14 50 

ACTCAGAACG CTTTGGAACC GAAGAATTAG CCCGTATCGA GGGAGATATG 150 0 

CTTGAGGCGC GTGAGAAGTC AGCCAACCTC GAATACGAAA TATTTATGCG 1550 

40 CATTCGTGAA GAGGTCGGCA AGTACATCCA GCGTTTACAA GCTCTAGCCC 1600 

AAGGAATTGC GACGGTTGAT GTCTTACAGA GTCTGGCGGT TGTGGCTGAA 16 50 

ACCCAGCATT TGATTCGACC TGAGTTTGGT GACGATTCAC AAATTGATAT 1700 

CCGGAAAGGG CGCCATGCTG TCGTTGAAAA GGTTATGGGG GCTCAGACCT 1750 

ATATTCCAAA TACGATTCAG ATGGCAGAAG ATACCAGTAT TCAATTGGTT 18 00 

45 ACAGGGCCAA ACATGAGTGG GAAGTCTACC TATATGCGTC AGTTAGCCAT 1850 

GACGGCGGTT ATGGCCCAGC TGGGTTCCTA TGTTCCTGCT GAAAGCGCCC 1900 

ATTTACCGAT TTTTGATGCG ATTTTTACCC GTATCGGAGC AGCAGATGAC 1950 

TTGGTTTCGG GTCAGTCAAC CTTTATGGTG GAGATGATGG AGGCCAATAA 2000 

TGCCATTTCG CATGCGACCA AGAACTCTCT CATTCTCTTT GATGAATTGG 2 050 

50 GACGTGGAAC TGCAACTTAT GACGGGATGG CTCTTGCTCA GTCCATCATC 2100 

GAATATATCC ATGAGCACAT CGGAGCTAAG ACCCTCTTTG CGACCCACTA 2150 

CCATGAGTTG ACTAGTCTGG AGTCTAGTTT ACAACACTTG GTCAATGTCC 22 00 

ACGTGGCAAC TTTGGAGCAG GATGGGCAGG TCACCTTCCT TCACAAGATT 2250 

GAACCGGGAC CAGCTGATAA ATCCTACGGT ATCCATGTTG CCAAGATTGC 23 00 

55 TGGCTTGCCA GCAGACCTTT TAGCAAGGGC GGATAAGATT TTGACTCAGC 2350 

TAGAGAATCA AGGAACAGAG AGTCCTCCTC CCATGAGACA AACTAGTGCT 24 00 

GTCACTGAAC AGATTTCACT CTTTGATAGG GCAGAAGAGC ATCCTATCCT 2450 

AGCAGAATTA GCTAAACTGG ATGTGTATAA TATGACACCT ATGCAGGTTA 2500 

TGAATGTCTT AGTAGAGTTA AAACAGAAAC TATAA 2535 



60 
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2) INFORMATION FOR SEQ ID NO: 1184 

(i) SEQUENCE CHT^J^CTERISTICS : 
5 (A) LENGTH: 62 3 bases 

(B) TYPE: Nucleic acid 

(C) STRANDEDNESS : Double 

(D) TOPOLOGY: Linear 

10 (ii) MOLECULE TYPE: Genomic DNA 

(vi) ORIGINAL SOURCE: 

(A) ORGANISM: Streptococcus pneumoniae 

(B) STRAIN: StrR-05 



15 



(xiii) SEQUENCE DESCRIPTION: SEQ ID NO: 1184 



TGACGGGTGA CTTTTATGTG ACAGGTCTTT TGGATTTCAC GCTGGTTTGT 50 

GGGGAAATCC GTAACCTCAA GGCTCGAGAA GTGGTGTTGG GTTATGACTT 100 

20 GTCTGAGGAA GAAGAACAAA TCCTCAGCCG CCAGATGAAT CTGGTACTCT 150 

CTTATGAA?^ AGAAAGCTTT GAAGACCTTC ATTTATTGGA TTTGCGATTG 2 00 

GCAACGGTGG AGCAAACGGC ATCTAGTAAG CTGCTCCAGT ATGTTCATCG 250 

GACTCAGATG AGGGAATTGA ACCACCTCAA ACCTGTTATC CGATACGAAA 3 00 

TTAAGGATTT CTTGCAGATG GATTATGCGA CCAAGGCTAG TCTGGATTTG 350 

25 GTTGAGAATG CTCGCTCAGG TAAGAAACAA GGCAGTCTTT TCTGGCTTTT 4 00 

GGATGAAACC AAAACGGCTA TGGGGATGCG TCTCTTGCGT TCTTGGATTC 450 

ATCGCCCCTT GATTGATAAG GAACGAATCG TCCAACGTCA AGAAGTAGTG 500 

CAGGTCTTTC TCGACCATTT CTTTGAGCGT AGTGACTTGA CAGACAGTCT 550 

CAAGGGTGTT TATGACATTG AGCGCTTGGC TAGTCGTGTT TCTTTTGGCA 600 

3 0 AAACCAATCC AAAGGATCTC TTG 623 



2) INFORMATION FOR SEQ ID NO: 1185 



35 



(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 621 bases 

(B) TYPE: Nucleic acid 

(C) STRANDEDNESS: Double 
40 (D) TOPOLOGY: Linear 

(ii) MOLECULE TYPE: Genomic DNA 

(vi) ORIGINAL SOURCE: 
45 (A) ORGANISM: Streptococcus pneumoniae 

(B) STRAIN: StrR-06 

(xiv) SEQUENCE DESCRIPTION: SEQ ID NO: 1185 

50 TGACGGGTGA CTTTTATGTG ACAGGTCTTT TGGATTTCAC GCTGGTTTGT 50 

GGGGAAATCC GTAACCTCAA GGCTCGAGAA GTGGTGTTGG GTTATGACTT 100 

GTCTGAGGAA GAAGAACAAA TCCTCAGCCG CCAGATGAAT CTGGTACTCT 150 

CTTATGAAAA AGAAAGCTTT GAAGACCTTC ATTTATTGGA TTTGCGATTG 200 

GCAACGGTGG AGCAAACGGC ATCTAGTAAG CTGCTCCAGT ATGTTCATCG 2 50 

55 GACTCAGATG AGGGAATTGA ACCACCTCAA ACCTGTTATC CGATACGAAA 3 00 

TTAAGGATTT CTTGCAGATG GATTATGCGA CCAAGGCTAG TCTGGATTTG 3 50 

GTTGAGAATG CTCGCTCAGG TAAGAAACAA GGCAGTCTTT TCTGGCTTTT 400 

GGATGAAACC AAAACGGCTA TGGGGATGCG TCTCTTGCGT TCTTGGATTC 450 

ATCGCCCCTT GATTGATAAG GAACGAATCG TCCAACGTCA AGAAGTAGTG 500 

60 CAGGTCTTTC TCGACCATTT CTTTGAGCGT AGTGACTTGA CAGACAGTCT 550 
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CAAGGGTGTT TATGACATTG AGCGCTTGGC TAGTCGTGTT TCTTTTGGCA 600 
AAACCAATCC AAAGGATCTC T 621 



2) INFORMATION FOR SEQ ID NO: 1186 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 622 bases 

(B) TYPE: Nucleic acid 

(C) STRANDEDNESS : Double 

(D) TOPOLOGY: Linear 

(ii) MOLECULE TYPE: Genomic DNA 

(vi) ORIGINAL SOURCE: 

(A) ORGANISM: Streptococcus pneumoniae 

(B) STRAIN: StrR-11 

(xv) SEQUENCE DESCRIPTION: SEQ ID NO: 1186 

TGACGGGTGA CTTTTATGTG ACAGGTCTTT TGGATTTCAC GCTGGTTTGT 50 

GGGGA/y^TCC GTAACCTCAA GGCTCGAGAA GTGGTGTTGG GTTATGACTT 100 

GTCTGAGGAA GAAGAACAAA TCCTCAGCCG CCAGATGAAT CTGGTACTCT 150 

CTTATGAAAA AGAAAGCTTT GAAGACCTTC ATTTATTGGA TTTGCGATTG 200 

GCAACGGTGG AGCAAACGGC ATCTAGTAAG CTGCTCCAGT ATGTTCATCG 250 

GACTCAGATG AGGGAATTGA ACCACCTCAA ACCTGTTATC CGCTACGAAA 300 

TTAAGGATTT CTTGCAGATG GATTATGCGA CCAAGGCTAG TCTGGATTTG 350 

GTTGAGAATG CTCGCTCAGG TAAGAAACAA GGCAGTCTTT TCTGGCTTTT 400 

GGATGAAACC AAAACGGCTA TGGGGATGCG TCTCTTGCGT TCTTGGATTC 450 

ATCGCCCCTT GATTGATAAG GAACGAATCG TCCAACGTCA AGAAGTAGTG 500 

CAGGTCTTTC TCGACCATTT CTTTGAGCGT AGTGACTTGA CAGACAGTCT 550 

CAAGGGTGTT TATGACATTG AGCGCTTGGC TAGTCGTGTT TCTTTTGGCA 6 00 

AAACCAATCC AAAGGATCTC TT 622 



2) INFORMATION FOR SEQ ID NO: 1187 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 622 bases 

(B) TYPE: Nucleic acid 

(C) STRANDEDNESS: Double 

(D) TOPOLOGY: Linear 

(ii) MOLECULE TYPE: Genomic DNA 

(vi) ORIGINAL SOURCE: 

(A) ORGANISM: Streptococcus pneumoniae 

(B) STRAIN: StrR-55 

(ix) SEQUENCE DESCRIPTION: SEQ ID NO: 1187 

TGACGGGTGA CTTTTATGTG ACAGGTCTTT TGGATTTCAC GCTGGTTTGT 50 

GGGGAAATCC GTAACCTCAA GGCTCGAGAA GTGGTGTTGG GTTATGACTT 100 

GTCTGAGGAA GAAGAACAAA TCCTCAGCCG CCAGATGAAT CTGGTACTCT 150 

CTTATGAAAA AGAAAGCTTT GAAGACCTTC ATTTATTGGA TTTGCGATTG 2 00 

GCAACGGTGG AGCAAACGGC ATCTAGTAAG CTGCTCCGGT ATGTTCATCG 2 50 

GACTCAGATG AGGGAATTGA ACCACCTCAA ACCTGTTATC CGCTACGAAA 3 00 

TTAAGGATTT CTTGCAGATG GATTATGCGA CCAAGGCTAG TCTGGATTTG 350 
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GTTGAGAATG CTCGCTCAGG TAAGAAACAA 
GGATGAAACC AAAACGGCTA TGGGGATGCG 
ATCGCCCCTT GATTGATAAG GAACGAATCG 
CAGGTCTTTC TCGACCATTT CTTTGAGCGT 
5 CAAGGGTGTT TATGACATTG AGCGCTTGGC 
AAACCAATCC AAAGGATCTC TT 



PCT/CAOO/01150 

GGCAGTCTTT TCTGGCTTTT 4 00 

TCTCTTGCGT TCTTGGATTC 450 
TCCAACGTCA AGAAGTAGTG 500 
AGTGACTTGA CAGACAGTCT 550 
TAGTCGTGTT TCTTTTGGCA 600 

622 



10 2 ) INFORMATION FOR SEQ ID NO: 1188 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 599 bases 

(B) TYPE: Nucleic acid 
15 (C) STRANDEDNESS : Double 

(D) TOPOLOGY: Linear 

(ii) MOLECULE TYPE: Genomic DNA 

20 (vi) ORIGINAL SOURCE: 

(A) ORGANISM: Streptococcus oretlis 

(B) STRAIN: ATCC 35037 



25 



(ix) SEQUENCE DESCRIPTION: SEQ ID NO: 1188 



GGGTGACTTT TATGTAACGG GGCTATTGGA TTTCACGTTG GTTTGTGGGG 50 

AAATTCGCAA TCTCAAGGCT AGAGAAGTGG TGCTGGGTTA TGACTTGTCT 100 

GAGGAAGT^G AACAAATCCT CAGTCGTCAG ATGAATCTGG TGCTTTCTTA 150 

TGAGAAGGAA GGCTTTGAGG ACCTTCATTT ACTGGATCCA CGACTGGCAG 2 00 

3 0 CTGTGGAGCA AGCGGCAGCT AGTAAGCTCC TCCAGTATGT TCACCGGACC 2 50 

CAGATGCGGG AATTGAACCA CCTCAAACCA GTTATCCGCT ATGAAATCAA 300 

AGATTTCTTA CAGATGGACT ATGCGACCAA GGCTAGTCTG GATTTGGTTG 350 

AGAATGCCCG TTCAGGCAAG AAGCAAGGCA GTCTTTTCTG GCTTTTAGAT 4 00 

GAAACCAAGA CGGCTATGGG AATGCGTCTC TTGCGTTCTT GGATTCATCG 450 

35 TCCTTTGATT GATAAGGAGC GAATCGTCCA GCGTCT^GAG GTGGTGCAGG 500 

TCTTTCTTGA CCACTTCTTT GAGCGTAGTG ATTTAACGGA CAGTCTTAAG 550 

GGTGTTTATG ATATCGAACG CTTGGCTAGT CGGGTTTCTT TTGGCAAGA 599 



40 



50 



2) INFORMATION FOR SEQ ID NO: 1189 



(i) SEQUENCE CHARACTERISTICS: 
(A) LENGTH: 624 bases 
45 (B) TYPE: Nucleic acid 

(C) STRANDEDNESS: Double 

(D) TOPOLOGY: Linear 



(ii) MOLECULE TYPE: Genomic DNA 

(vi) ORIGINAL SOURCE: 

(A) ORGANISM: StJTGptococcus mitis 

(B) STRAIN: ATCC 49456 

55 (x) SEQUENCE DESCRIPTION: SEQ ID NO : 118 9 

GGTGACGGGT GACTTTTATG TGACAGGTCT TTTGGATTTT ACGCTGGTTT 50 

GTGGGGAAAT CCGCAATCTC AAGGCTCGAG AAGTGGTGCT GGGTTATGAC 100 

TTGTCTGAGG AAGAAGAACA GATCCTTAGT CGTCAGATGA ATCTGGTACT 150 

60 TTCCTATGAA AAAGAAGGCT TTGAAGACCT TCATTTACTG GATTCACGAT 200 

650 
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TGGCAGCTGT GGAGCAAGCG 
CGGACTCAGA TGAGGGAATT 
AATCi\AAGAT TTTTTGCAGA 
TGGTTGAGAA TGCCCGTTCA 
5 TTGGATGAAA CCAAAACAGC 
TCATCGCCCC CTGATTGATA 
TGCAGGTCTT TCTCGACCAT 
CTCAAGGGTG TTTATGACAT 
CAAAACCAAT CCAAAGGATC 



GCATCTAGTA AACTGCTTCA 
GAACCACCTC AAGCCTGTTA 
TGGATTATGC GACCAAGGCT 
GGCAAGAAGC AAGGTAGTCT 
TATGGGAATG CGTCTCTTGC 
AGGAACGAAT TGTCCAACGC 
TTCTTTGAGC GTAGTGATTT 
TGAGCGCTTG GCTAGTCGTG 
TCTT 
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GTATGTTCAT 250 

TCCGCTATGA 300 

AGTCTGGATT 350 

TTTTTGGCTT 400 

GGTCTTGGAT 450 

CAAGAAGTTG 500 

GACAGACAGT 550 

TTTCTTTTGG 600 

624 



2) INFORMATION FOR SEQ ID NO: 1190 

15 (i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 599 bases 

(B) TYPE: Nucleic acid 

(C) STRANDEDNESS : Double 

(D) TOPOLOGY: Linear 



20 



(ii) MOLECULE TYPE: Genomic DNA 



(vi) ORIGINAL SOURCE: 

(A) ORGANISM: Streptococcus mitis 
25 (B) STRAIN: LSPQ 2583 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 1190 

TGACGGGTGA CTTTCAGGTG ACTAGTTTAG AGGACTTTGT CTTGGTCTGC 50 

30 GGGGAAATCC GCAATTTGAA AGCTAGGGAA GTGGTGCTGG GCTATGCCTT 100 

GCCAGAAGCT GAGGAGCAGG TTTTGGCTGG ACAGATGAAC CTTTTACTGT 150 

CCTATGTGGA GAAGGTTTTG GAGGATGTTC AGCTGCTGGG CGAGGAGCTG 200 

TCTCCTATGG AGCGTCAGGC AGCAGGGAAA CTGCTGGAGT ATGTGCACCG 250 

GACCCAGATG AGGGAGCTCA GCCATTTGAA GAAGGCTCAG CATTATGAAA 300 

35 TCAAGGACTT CCTGCAAATG GACTATGCCA CCAAGGCGAG TCTGGATTTG 350 

ACAGAAAATG CTCGCTCGGG CAAGAAGCAC GGCAGTCTTT ATTGGCTGAT 400 

GGACGAGACT AAGACGGCCA TGGGCGGCCG CATGCTGCGC TCTTGGATCC 4 50 

AGCGTCCGCT GATTGATGAA GCGCGAATTA GCCAGCGACA GAATGTCGTT 500 

GAGGTTTTTC TGGATCATTT CTTTGAGCGG AGTGATTTGA CGGAGAGCCT 550 

40 CAAGGGGGTC TATGATATCG AGCGGCTGGC TAGTCGGGTG TCTTTTGGC 599 



2) INFORMATION FOR SEQ ID NO: 1191 

45 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 622 bases 

(B) TYPE: Nucleic acid 

(C) STRANDEDNESS: Double 
50 (D) TOPOLOGY: Linear 

(ii) MOLECULE TYPE: Genomic DNA 

(vi) ORIGINAL SOURCE: 
55 (A) ORGANISM: Streptococcus witis 

(B) STRAIN: ATCC 903 

(ix) SEQUENCE DESCRIPTION: SEQ ID NO: 1191 

60 TGACGGGTGA CTTTCAGGTG ACTAGTTTAG AGGACTTTGC CCTGGTCTGC 50 
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GGGGAAATCC GTAATTTGAA 
GCCAGAAGCT GAGGAGCAGG 
CCTATGTACA GACGGCCTTG 
TCTCCTATGG AGCGTCAGGC 
5 GACCCAGATG AGGGAGCTCA 
TCAAGGACTT TCTGCAAATG 
ACAGAAAATG CTCGCTCGGG 
GGACGAGACC AAGACGGCCA 
AGCGTCCGTT GATTGATGAA 
10 GAGGTTTTTC TGGAACATTT 
CAAGGGAGTC TATGATATCG 
AGACCAATCC AAAGGATCTC 



GGCTAGGGAA GTGGTGCTGG 
TCTTGGCTGG ACAGATGAAT 
GACGATGTCC AGCTGCTGGG 
AGCGGGGAAA TTGCTAGAGT 
GCCATTTGAA GAAGGCCCAG 
GATTATGCTA CCAAGGCGAG 
TAAGAAACAC GGCAGTCTTT 
TGGGCGGCCG TATGCTGCGC 
GTGCGAATTA GCCAGCGGCA 
CTTTGAGCGG AGTGATTTGA 
AGCGGCTGGC TAGTCGGGTG 
TT 



GCTATGCTTT 100 

CTTTTGCTGT 150 

CGAGGAACTG 200 

ATGTGCACCG 2 50 

CATTATGAAA 3 00 

TCTGGATTTG 350 

ATTGGCTGAT 4 00 

TCTTGGATCC 450 

GAATGTCGTC 500 

CGGAGAGCCT 550 

TCTTTTGGCA 600 

622 



15 



25 



30 



2) INFORMATION FOR SEQ ID NO: 1192 



(i) SEQUENCE CHARACTERISTICS: 
(A) LENGTH: 22 bases 
20 (B) TYPE: Nucleic acid 

(C) STRANDEDNESS : Single 

(D) TOPOLOGY: Linear 



(ii) MOLECULE TYPE: DNA 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO : 1192 
GGTAAAACAG GAACCTCTAA CT 22 

2) INFORMATION FOR SEQ ID NO: 1193 



(i) SEQUENCE CHARACTERISTICS: 
35 (A) LENGTH: 22 bases 

(B) TYPE: Nucleic acid 

(C) STRANDEDNESS: Single 
{ D ) TOPOLOGY : Linear 

40 (ii) MOLECULE TYPE: DNA 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 1193 

GGTAAGACAG GTACTTCTAA CT 22 

45 



2) INFORMATION FOR SEQ ID NO: 1194 

50 (i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 24 bases 

(B) TYPE: Nucleic acid 

(C) STRANDEDNESS: Single 

(D) TOPOLOGY: Linear 

55 

(ii) MOLECULE TYPE: DNA 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 1194 

60 
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CATTTCAAGT AATACAACAG AATC 



PCT/CAOO/01150 

24 



5 2) INFORMATION FOR SEQ ID NO: 1195 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 24 bases 

(B) TYPE: Nucleic acid 
10 (C) STRANDEDNESS : Single 

(D) TOPOLOGY: Linear 

(ii) MOLECULE TYPE: DNA 
15 (xi) SEQUENCE DESCRIPTION: SEQ ID .NO: 1195 

CATTTCAAGT AACACAACTG AATC 24 



20 



30 



35 



2) INFORMATION FOR SEQ ID NO: 1196 



(i) SEQUENCE CHARACTERISTICS: 
(A) LENGTH: 24 bases 
25 (B) TYPE: Nucleic acid 

(C) STRANDEDNESS: Single 

(D) TOPOLOGY: Linear 



(ii) MOLECULE TYPE: DNA 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 1196 
GCCATTTCAA GTAATACAAC AGAA 24 

2 ) INFORMATION FOR SEQ ID NO: 1197 



(i) SEQUENCE CHARACTERISTICS: 
40 (A) LENGTH: 25 bases 

(B) TYPE: Nucleic acid 

(C) STRANDEDNESS: Single 

(D) TOPOLOGY: Linear 

45 (ii) MOLECULE TYPE: DNA 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 1197 
CAAACGCCAT TTCAAGTAAT ACAAC 25 

50 



2) INFORMATION FOR SEQ ID NO: 1198 

55 (i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 381 bases 

(B) TYPE: Nucleic acid 

(C) STRANDEDNESS : Double 

(D) TOPOLOGY : Linear 
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(ii) MOLECULE TYPE: Genomic DNA 



PCT/CAOO/01150 



(vi) ORIGINAL SOURCE: 

(A) ORGANISM: Staphylococcus Sctprophyticus 

(B) STRAIN: ATCC 43867 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 1198 



2 ) INFORMATION FOR SEQ ID NO: 1199 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 25 bases 

(B) TYPE: Nucleic acid 

(C) STRANDEDNESS : Single 

(D) TOPOLOGY: Linear 

(ii) MOLECULE TYPE: DNA 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 1199 
GTATTAAAGA AGATATCCAA AAAGC 



2) INFORMATION FOR SEQ ID NO: 1200 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 25 bases 

(B) TYPE: Nucleic acid 

(C) STRANDEDNESS: Single 

(D) TOPOLOGY: Linear 

(ii) MOLECULE TYPE: DNA 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 1200 
TCA/y^GAAGA AACTAAT^AAA GCTGT 



2) INFORMATION FOR SEQ ID NO: 1201 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 19 bases 

(B) TYPE: Nucleic acid 

(C) STRT^NDEDNESS : Single 

(D) TOPOLOGY: Linear 

(ii) MOLECULE TYPE: DNA 



AACGGGCGTC TCGATAGAAA 
ATCAATATAC GATTTTGCTG 
CAAATAAAAA TTAATAGGCA 
GATTCTGGTT TTTTAAATTC 
TTTAAAGTAT AGGTATGAAA 
ATCATGAAAC AATTAACTAA 
TACAACAACG CTGATTGGCG 
AACATCCTAT TGATTTTGTG 



AACACGTGAA 
AAGCTGTAAC 
ACTTAACCAG 
AAAAAGTTTT 
TACAATTGAT 
GCCTTTATAC 
CGTTACTATT 
GACGCCCGTT 



AATCCCAATG ATTATAAACA 
AAAAGGTATT AAGGAACAAA 
AATCGTTAAA ACTATATGAC 
CTAAAAAATT TACTTGCTTC 
TAAAATAGTA AAGGAAATGA 
TTTTACCTAT TACTTTTTAT 
ATATTTGCCA ATCACAKGTA 
A 



50 
100 
150 
200 
250 
300 
350 
381 
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(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 1201 
AACGTAGGTG TCCTTCTTC 



PCT/CAOO/01150 



2) INFORMATION FOR SEQ ID NO: 1202 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 22 bases 

(B) TYPE: Nucleic acid 

(C) STRANDEDNESS : Single 

(D) TOPOLOGY: Linear 

(ii) MOLECULE TYPE: DNA 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 1202 
GTGTTGAAAT GTTCCGTAAA CA 



2 ) INFORMATION FOR SEQ ID NO : 12 03 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 21 bases 

(B) TYPE: Nucleic acid 

(C) STRANDEDNESS: Single 

(D) TOPOLOGY: Linear 

(ii) MOLECULE TYPE: DNA 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 12 03 
GGIGARMGIG GIAAYGARAT G 



2) INFORMATION FOR SEQ ID NO: 1204 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 21 bases 

(B) TYPE: Nucleic acid 

(C) STRANDEDNESS: Single 

(D) TOPOLOGY: Linear 

(ii) MOLECULE TYPE: DNA 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 12 04 
GCIAAYAACI TCIWMYATGC C 



2 ) INFORMATION FOR SEQ ID NO: 1205 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 2 0 bases 

(B) TYPE: Nucleic acid 

655 



SUBSTITUTE SHEET (RULE 26) 



wo 01/23604 PCT/CAOO/01 150 

(C) STRANDEDNESS : Single 

(D) TOPOLOGY: Linear 

(ii) MOLECULE TYPE: DNA 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 1205 
AAYACITCIA WYATGCCIGT 



2) INFORMATION FOR SEQ ID NO: 1206 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 19 bases 

(B) TYPE: Nucleic acid 

(C) STRANDEDNESS: Single 

(D) TOPOLOGY: Linear 

(ii) MOLECULE TYPE: DNA 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 1206 
CKISRIGTIG ARTCIGCCA 



2) INFORMATION FOR SEQ ID NO: 1207 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 2 0 bases 

(B) TYPE: Nucleic acid 

(C) STRANDEDNESS: Single 

(D) TOPOLOGY: Linear 

(ii) MOLECULE TYPE: DNA 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 1207 
CCITCITCWC CIGGCATYTC 



2) INFORMATION FOR SEQ ID NO: 1208 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 25 bases 

(B) TYPE: Nucleic acid 

(C) STRANDEDNESS: Single 

(D) TOPOLOGY: Linear 

(ii) MOLECULE TYPE: DNA 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 12 08 
TCAAAAAGTT TTCTAAAAAA TTTAC 



2) INFORMATION FOR SEQ ID NO: 12 09 
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(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 25 bases 

(B) TYPE: Nucleic acid 

(C) STRANDEDNESS : Single 

(D) TOPOLOGY: Linear 

(ii) MOLECULE TYPE: DNA 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 1209 
ACGGGCGTCC ACAT^AATCAA TAGGA 



2) INFORMATION FOR SEQ ID NO: 1210 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 22 bases 

(B) TYPE: Nucleic acid 

(C) STRANDEDNESS: Single 

(D) TOPOLOGY: Linear 

(ii) MOLECULE TYPE: DNA 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 1210 
ACCAGCTTGC CCAATACAAA GG 



2) INFORMATION FOR SEQ ID NO: 1211 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 24 bases 

(B) TYPE: Nucleic acid 

(C) STRANDEDNESS: Single 

(D) TOPOLOGY: Linear 

(ii) MOLECULE TYPE: DNA 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 1211 
ATTCTTGTAA CAGGCTTTGA TCCC 



2) INFORMATION FOR SEQ ID NO: 1212 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 23 bases 

(B) TYPE: Nucleic acid 

(C) STRANDEDNESS : Single 

(D) TOPOLOGY: Linear 

(ii) MOLECULE TYPE: DNA 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 1212 
CCICCIRGIG GIGAIACIGC WCC 
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2) INFORMATION FOR SEQ ID NO: 1213 

5 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 2 6 bases 

(B) TYPE: Nucleic acid 

(C) STRANDEDNESS : Single 
10 (D) TOPOLOGY: Linear 

(ii) MOLECULE TYPE: DNA 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 1213 

15 

AARGGIGGIA CIGCIGCIAT HCCIGG 2 6 



20 2) INFORMATION FOR SEQ ID NO: 1214 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 23 bases 

(B) TYPE: Nucleic acid 
25 (C) STRANDEDNESS: Single 

(D) TOPOLOGY: Linear 

(ii) MOLECULE TYPE: DNA 
30 (xi) SEQUENCE DESCRIPTION: SEQ ID NO: 1214 

GGTAT^AACAG GTACCTCTAA CTA 23 



35 



45 



50 



2 ) INFORMATION FOR SEQ ID NO: 1215 



(i) SEQUENCE CHARACTERISTICS: 
(A) LENGTH: 1337 bases 
40 (B) TYPE: Nucleic acid 

(C) STRANDEDNESS: Double 

(D) TOPOLOGY: Linear 



(ii) MOLECULE TYPE: Genomic DNA 

(vi) ORIGINAL SOURCE: 

(A) ORGANISM: Streptococcus pyogenes 

(B) STRAIN: D471 

(C) ACCESSION NUMBER: X65717 

(xii) SEQUENCE DESCRIPTION: SEQ ID NO : 1215 



AACAAAATAA AAGAACTTAC CTATTTTCCA TCCAAAATGT TTAGCAATCA 50 

TCATCTGCAA GGCAACGTAT TGCATGGCAT TGATGTGATG AGCAACTAAT 100 

55 ATGTCATTAG AACGTTGCGT CAAACTAGCA TCTAAATAAA GATCGAAATG 150 

CAGTTATCAA AAATGCAAGC TCCTATCGGC CCTTGTTTTA ATTATTACTC 200 

ACATTGCCTT AATGTATTTA CTTGCTTATT ATTAACTTTT TTGCTAAGTT 250 

AGTAGCGTCA GTTATTCATT GAAAGGACAT TATTATGAAA ATTCTTGTAA 3 00 

CAGGCTTTGA TCCCTTTGGC GGCGAAGCTA TTAATCCTGC CCTTGAAGCT 3 50 

60 ATCAAGAAAT TGCCAGCAAC CATTCATGGA GCAGAAATCA AATGTATTGA 4 00 
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AGTTCCAACG GTTTTTCAAA AATCTGCCGA TGTGCTCCAG CAGCATATCG 4 50 

AAAGCTTTCA ACCTGATGCA GTCCTTTGTA TTGGGCAAGC TGGTGGCCGG 500 

ACTGGACTAA CGCCAGAACG CGTTGCCATT AATCAAGACG ATGCTCGCAT 550 

TCCTGATAAC GAAGGGAATC AGCCTATTGA TACACCTATT CGTGCAGATG 60 0 

5 GTAAAGCAGC TTATTTTTCA ACCTTGCCAA TCAAAGCGAT GGTTGCTGCC 650 

ATTCATCAGG CTGGGCTTCC TGCTTCTGTT TCTAATACAG CTGGTACCTT 700 

TGTTTGCAAT CATTTGATGT ATCAAGCCCT TTACTTAGTG GATAAATATT 750 

GTCCAAATGC CAAAGCTGGG TTTATGCATA TTCCCTTTAT GATGGAACAG 8 00 

GTTGTTGATA AACCTAATAC AGCTGCCATG AACCTCGATG ATATTACAAG 850 

10 AGGAATTGAG GCTGCTATTT TTGCCATTGT CGATTTCAAA GATCGTTCCG 900 

ATTTAAAACG TGTAGGGGGC GCTACTCACT GACTGTGACG CTACTAAACC 950 

TATTTTAAAA AAACAGAGAT ATGAACTAAC TCTGTTTTTT TTGTGCTT^U^. 1000 

. AATGAAAGAC CTAGGGAAAC TTTTCATCGG TCTTTCTCAA TTGTCATCTT 1050 

AATCTAATAC TACTTCTAAC ATCAGCGGGT ATAGTTTGCC AGTAATTAAG 1100 

15 AAACGTTGTT GATCTAAATG AGCAATCCCA TTCAAAACAT TAAGGTCAGG 1150 

GTAATGGGAC TTATCAAGAT TTAAGGCTTT TAACAAAGGA CTAATATCAT 1200 

AGGTGGCTAC CACCTTTCCA GAATCAGGTT GGAGTTTGAC AATAGTATTG 12 50 

GTTTGCCAAA TATTGGCATA GAGATAACCA TCTACATACT CTAATTCGTT 1300 

AAGCATTGAG ATAGGGACAC TTTCTATAGC AACTAGT 1337 

20 



2) INFORMATION FOR SEQ ID NO: 1216 

25 (i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 23 bases 

(B) TYPE: Nucleic acid 

(C) STRANDEDNESS : Single 

(D) TOPOLOGY: Linear 

30 

(ii) MOLECULE TYPE: DNA 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 1216 
35 GGTAAGACTG GTACATCAAA CTA 23 



40 



2) INFORMATION FOR SEQ ID NO: 1217 



(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 25 bases 

(B) TYPE: Nucleic acid 

(C) STRANDEDNESS: Single 
45 (D) TOPOLOGY: Linear 

(ii) MOLECULE TYPE: DNA 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 1217 

50 

CAAATGCCAT TTCAAGTAAC ACAAC 25 



55 2) INFORMATION FOR SEQ ID NO: 1218 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 25 bases 

(B) TYPE: Nucleic acid 
60 (C) STRANDEDNESS: Single 

659 



SUBSTITUTE SHEET (RULE 26) 



LO 



20 



25 



WO 01/23604 PCT/CAOO/01150 
(D) TOPOLOGY: Linear 

(ii) MOLECULE TYPE: DNA 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 1218 
CAAACGCCAT TTCAAGTAAC ACAAC 25 

2) INFORMATION FOR SEQ ID NO: 1219 



(i) SEQUENCE CHARACTERISTICS: 
(A) LENGTH: 25 bases 
15 (B) TYPE: Nucleic acid 

(C) STRANDEDNESS : Single 

(D) TOPOLOGY: Linear 



(ii) MOLECULE TYPE: DNA 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 1219 
CAAATGCTAT TTCAAGTAAT ACAAC 25 

2) INFORMATION FOR SEQ ID NO: 1220 



(i) SEQUENCE CHARACTERISTICS: 
30 (A) LENGTH: 25 bases 

(B) TYPE: Nucleic acid 

(C) STRANDEDNESS: Single 

(D) TOPOLOGY: Linear 

35 (ii) MOLECULE TYPE: DNA 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 1220 
CAAACGCCAT TTCAAGTAAT ACGAC 25 

40 

2) INFORMATION FOR SEQ ID NO: 1221 

45 (i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 23 bases 

(B) TYPE: Nucleic acid 

(C) STRANDEDNESS: Single 

(D) TOPOLOGY: Linear 

50 

(ii) MOLECULE TYPE: DNA 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 1221 
55 GAYACICCIG GICAYGTIGA YTT 23 



60 



2) INFORMATION FOR SEQ ID NO: 1222 
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(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 26 bases 

(B) TYPE: Nucleic acid 

(C) STRANDEDNESS : Single 

(D) TOPOLOGY: Linear 

(ii) MOLECULE TYPE: DNA 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 1222 
ATYGAYACIC CIGGICAYGT IGAYTT 26 



L5 2) INFORMATION FOR SEQ ID NO: 1223 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 26 bases 

(B) TYPE: Nucleic acid 
20 (C) STRANDEDNESS: Single 

(D) TOPOLOGY: Linear 

(ii) MOLECULE TYPE: DNA 
25 (xi) SEQUENCE DESCRIPTION: SEQ ID NO: 1223 

AYITCIARRT GIARYTCRCC CATICC 26 



2) INFORMATION FOR SEQ ID NO: 1224 



(i) SEQUENCE CHARACTERISTICS: 
(A) LENGTH: 24 bases 
35 (B) TYPE: Nucleic acid 

(C) STRANDEDNESS: Single 

(D) TOPOLOGY: Linear 



(ii) MOLECULE TYPE: DNA 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 1224 
CCIGYIHTIY TIGARCCIAT lATG 24 

2 ) INFORMATION FOR SEQ ID NO: 1225 



(i) SEQUENCE CHARACTERISTICS: 
50 (A) LENGTH: 26 bases 

(B) TYPE: Nucleic acid 

(C) STRANDEDNESS: Single 

(D) TOPOLOGY: Linear 

55 (ii) MOLECULE TYPE: DNA 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 1225 
TAICCRAACA TYTCISMIAR IGGIAC 26 
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10 



20 



2) INFORMATION FOR SEQ ID NO: 1226 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 2 0 bases 

(B) TYPE: Nucleic acid 

(C) STRANDEDNESS : Single 

(D) TOPOLOGY: Linear 



(ii) MOLECULE TYPE: DNA 
(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 1226 
15 GTIRMRTAIC CRAACATYTC 2 0 



2) INFORMATION FOR SEQ ID NO: 1227 



(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 26 bases 

(B) TYPE: Nucleic acid 

(C) STRANDEDNESS: Single 
25 (D) TOPOLOGY: Linear 

(ii) MOLECULE TYPE: DNA 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO : 1227 

30 

GTICCIYTIK CIGARATGTT YGGITA 26 



35 2) INFORMATION FOR SEQ ID NO: 1228 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 2 9 bases 

(B) TYPE: Nucleic acid 
40 (C) STRANDEDNESS: Single 

(D) TOPOLOGY: Linear 

(ii) MOLECULE TYPE: DNA 
45 (xi) SEQUENCE DESCRIPTION: SEQ ID NO: 1228 

GTICCIYTIK CIGARATGTT YGGITAYGC 2 9 



50 2) INFORMATION FOR SEQ ID NO: 1229 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 26 bases 

(B) TYPE: Nucleic acid 
55 (C) STRANDEDNESS: Single 

(D) TOPOLOGY : Linear 

(ii) MOLECULE TYPE: DNA 

60 (xi) SEQUENCE DESCRIPTION: SEQ ID NO: 1229 
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TCCATYTGIG CIGCICCIGT lATCAT 



26 



10 



15 



20 



25 



30 



35 



40 



45 



50 



55 



60 



2) INFORMATION FOR SEQ ID NO: 1230 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 214 5 bases 

(B) TYPE: Nucleic acid 

(C) STRANDEDNESS : Double 

(D) TOPOLOGY: Linear 

(ii) MOLECULE TYPE: Genomic DNA 

(vi) ORIGINAL SOURCE: 

(A) ORGANISM: Escherichia coli 
(D) ACCESSION NUMBER: X00415 

(xiii) SEQUENCE DESCRIPTION: SEQ ID NO: 1230 



TGAACGCCTA 
CGCACGCTAC 
CCACTACTAC 
GGTGAAGTTC 
GGAACGTGGT 
GTATGGCTAA 
GGGCACGTTG 
TGGTGCGGTA 
AAACCGTATG 
GTTAACAAAA 
GATCAAAACC 
GTGCTGAAGA 
ATCAACTGGA 
CCCGGCAGAC 
AATCCGCAGC 
GAAGAACTGA 
GAACAACGAA 
GTGTTCAGGC 
GACGTACCTG 
TGAACGTCAC 
TCGCTACCGA 
GGTGTGGTTA 
TGAGCGTTTC 
TCAAAGAAGT 
GTAACCACTG 
ACGTATGGAA 
CCAAAGCTGA 
GAAGACCCGT 
CATCGCGGGT 
AGCGTGAATT 
CGTGAAACTA 
ACAGTCTGGT 
CGCTGGAGCC 
AAAGGTGGTG 
CCAGGAACAG 
TGGGTATTCG 
CTGGCGTTTA 
AGCGAAACCA 
CGGAAGAGAA 
ATGCTCAAAG 



ATU^GATAAAC 
CGTAACATCG 
CGAACGTATT 
ATGACGGCGC 
ATTACCATCA 
GCAGTATGAG 
ACTTCACAAT 
ATGGTTTACT 
GCGTCAGGCA 
TGGACCGCAT 
CGTCTGGGCG 
ACATTTCACC 
ACGACGCTGA 
ATGGTTGAAC 
TGAAGCTTCT 
CTGAAGCAGA 
ATCATCCTGG 
GATGCTGGAT 
CGATCAACGG 
GCAAGTGATG 
CCCGTTTGTT 
ACTCTGGTGA 
GGTCGTATCG 
TCGCGCGGGC 
GTGACACCCT 
TTCCCTGAGC 
CCAGGAAAAA 
CTTTCCGTGT 
ATGGGCGAAC 
CAACGTTGAA 
TCCGCCAGAA 
GGTCGTGGTC 
GGGTTCT^AAC 
TAATCCCTGG 
CTGAAAGCAG 
TCTGCACTTC 
AACTGGCTGC 
GTTCTGCTTG 
CACCGGTGAC 
GTCAGGAATC 



GAGGAAACAA 
GTATCAGTGC 
CTGTTCTACA 
TGCAACCATG 
CTTCCGCTGC 
CCGCATCGCA 
CGAAGTAGAA 
GCGCAGTTGG 
AACAAATATA 
GGGTGCGAAC 
CGAACCCGGT 
GGTGTTGTTG 
CCAGGGCGTA 
TGGCTAACGA 
GAAGAGCTGA 
AATCAAAGGT 
TAACCTGTGG 
GCGGTAATTG 
TATCCTGGAC 
ACGAGCCGTT 
GGTAACCTGA 
TACCGTACTG 
TTCAGATGCA 
GACATCGCTG 
GTGTGACCCG 
CGGTAATCTC 
ATGGGTCTGG 
ATGGACTGAC 
TGCACCTCGA 
GCGAACGTAG 
AGTTACCGAT 
AGTATGGTCA 
CCGAAAGGCT 
CGAATACATC 
GTCCGCTGGC 
GGTTCTTACC 
TTCTATCGCC 
AGCCGATCAT 
GTTATCGGTG 
TGAAGTTACT 



ATGGCTCGTA 
GCACATCGAC 
CCGGTGTAAA 
GACTGGATGG 
GACTACTGCA 
TCAACATCAT 
CGTTCCATGC 
TGGTGTTCAG 
AAGTTCCGCG 
TTCCTGAAAG 
TCCGCTGCAG 
ACCTGGTGAA 
ACCTTCGAAT 
ATGGCACCAG 
TGGAAAAATA 
GCTCTGCGTC 
TTCTGCGTTC 
ATTACCTGCC 
GACGGTAAAG 
CTCTGCACTG 
CCTTCTTCCG 
AACTCCGTGA 
CGCTAACAAA 
CTGCTATCGG 
GATGCGCCGA 
CATCGCAGTT 
CTCTGGGCCG 
GAAGAATCTA 
CATCATCGTT 
GTAAACCGCA 
GTTGAAGGTA 
TGTTGTTATC 
ACGAGTTCAT 
CCGGCCGTTG 
AGGCTACCCG 
ATGACGTTGA 
TTTAAAGAAG 
GAAGGTTGAA 
ACTTGAGCCG 
GGCGTTAAGA 



CAACACCCAT 50 

GCCGGTAAAA 10 0 

CCATAAAATC 150 

AGCAGGAGCA 200 

TTCTGGTCTG 2 50 

CGACACCCCG 3 00 

GTGTTCTCGA 350 

CCGCAGTCTG 4 00 

CATTGCGTTC 450 

TTGTTAACCA 500 

CTGGCGATTG 550 

AATGAAAGCT 600 

ACGAAGATAT 650 

AACCTGATCG 70 0 

CCTGGGTGGT 750 

AGCGCGTTCT 800 

AAGAACAAAG 850 

ATCCCCGGTT 900 

ACACTCCGGC 950 

GCGTTCAAAA 1000 

TGTTTACTCC 1050 

AAGCTGCACG 1100 

CGTGAAGAGA 1150 

TCTGAAAGAC 1200 

TCATTCTGGA 1250 

GAACCGAAAA 13 00 

TCTGGCTAAA 1350 

ACCAGACCAT 1400 

GACCGTATGA 14 50 

GGTTGCTTAC 15 00 

AACACGCGAA 1550 

GACATGTACC 1600 

CAACGACATT 1650 

ATAAAGGTAT 1700 

GTAGTAGACA 1750 

CTCCTCTGAA 1800 

GCTTTAAGAA 1850 

GTAGAAACTC 1900 

TCGTCGTGGT 1950 

TCCACGCTGA 2000 
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AGTACCGCTG TCTGAAATGT TCGGATACGC AACTCAGCTG CGTTCTCTGA 2 050 

CCAAAGGTCG TGCATCATAC ACTATGGAAT TCCTGAAGTA TGATGAAGCG 2100 

CCGAGTAACG TTGCTCAGGC CGTT^TTGAA GCCCGTGGTA AATAA 2145 



2) INFORMATION FOR SEQ ID NO: 1231 

(i) SEQUENCE CHARACTERISTICS: 
10 (A) LENGTH: 37 bases 

(B) TYPE: Nucleic acid 

(C) STRANDEDNESS: Single 

(D) TOPOLOGY: Linear 

15 (ii) MOLECULE TYPE: DNA 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 1231 
GCGAGCCCGA AGATAAAAAA GAACCTCTGC TGCTCGC 3 7 

20 

2) INFORMATION FOR SEQ ID NO: 1232 

25 (i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 42 bases 

(B) TYPE: Nucleic acid 

(C) STRANDEDNESS: Single 

(D) TOPOLOGY: Linear 

30 

(ii) MOLECULE TYPE: DNA 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 1232 
35 GGAGCCGCGC GATTTTATAA ATGAATGTTG ATAACCGGCT CC 42 



40 



2 ) INFORMATION FOR SEQ ID NO: 1233 



(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 3 5 bases 

(B) TYPE: Nucleic acid 

(C) STRANDEDNESS: Single 
45 (D) TOPOLOGY: Linear 

(ii) MOLECULE TYPE: DNA 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 1233 

50 

GCGAGCGTTA CTGGTGTAGA AATGTTCCGG CTCGC 35 



55 2) INFORMATION FOR SEQ ID NO: 1234 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 20 bases 

(B) TYPE: Nucleic acid 
60 (C) STRANDEDNESS: Single 
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(D) TOPOLOGY: Linear 

(ii) MOLECULE TYPE: DNA 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 1234 
ACTAAATAAA CGCTCATTCG 20 

2) INFORMATION FOR SEQ ID NO: 1235 



(i) SEQUENCE CHARACTERISTICS: 
(A) LENGTH: 38 bases 
15 (B) TYPE: Nucleic acid 

(C) STRANDEDNESS : Single 

(D) TOPOLOGY: Linear 



(ii) MOLECULE TYPE: DNA 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 1235 
GCGAGCCGAA GTTGAAGTTG TTGGTATTGC TGGCTCGC 38 

2) INFORMATION FOR SEQ ID NO: 1236 



(i) SEQUENCE CHARACTERISTICS: 
3 0 (A) LENGTH: 34 bases 

(B) TYPE: Nucleic acid 

(C) STRANDEDNESS: Single 

(D) TOPOLOGY: Linear 

35 (ii) MOLECULE TYPE: DNA 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 1236 
GCGAGCCGTG GTGAAGTTCG CGTTGGTGGC TCGC 34 

40 

2) INFORMATION FOR SEQ ID NO: 1237 

45 (i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 38 bases 

(B) TYPE: Nucleic acid 

(C) STRANDEDNESS: Single 

(D) TOPOLOGY: Linear 

50 

(ii) MOLECULE TYPE: DNA 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 1237 
55 GCGAGCCGCG AAATCGAAGT TGCTGTATTA GGGCTCGC 38 



60 



2) INFORMATION FOR SEQ ID NO: 1238 

665 



SUBSTITUTE SHEET (RULE 26) 



wo 01/23604 



PCT/CAOO/01150 



(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 38 bases 

(B) TYPE: Nucleic acid 

(C) STRANDEDNESS : Single 

(D) TOPOLOGY: Linear 

(ii) MOLECULE TYPE: DNA 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 1238 
GCGAGCG6CG TTAATTTTGG CACCGAAGAA GAGCTCGC 



2) INFORMATION FOR SEQ ID NO: 1239 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 33 bases 

(B) TYPE: Nucleic acid 

(C) STRANDEDNESS: Single 

(D) TOPOLOGY : Linear 

(ii) MOLECULE TYPE: DNA 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 123 9 
GCGAGCGCAG ACCTTTCAGC AGAGGAGGCT CGC 



2 ) INFORMATION FOR SEQ ID NO: 1240 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 38 bases 

(B) TYPE: Nucleic acid 

(C) STRANDEDNESS : Single 

(D) TOPOLOGY: Linear 

(ii) MOLECULE TYPE: DNA 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 1240 
GCGAGCCGGC AAGACAATAT GACAGCAAAA TCGCTCGC 



2) INFORMATION FOR SEQ ID NO: 1241 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 35 bases 

(B) TYPE: Nucleic acid 

(C) STRANDEDNESS: Single 

(D) TOPOLOGY: Linear 

(ii) MOLECULE TYPE: DNA 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 1241 
GCGAGCGGGG AACGAGGATG ATTTGATTGG CTCGC 
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2) INFORMATION FOR SEQ ID NO: 1242 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 600 bases 

(B) TYPE: Nucleic acid 

(C) STRANDEDNESS : Double 

(D) TOPOLOGY: Linear 

(ii) MOLECULE TYPE: Genomic DNA 



(vi) ORIGINAL SOURCE: 

(A) ORGANISM: EntGrococcus faecium 
15 (E) STRAIN: BM4147-1 

(F) ACCESSION NUMBER: U3 9790 

(xiv) SEQUENCE DESCRIPTION: SEQ ID NO: 1242 

20 TTCTTAGAGA CATTGAATAT GCCTTATGTC GGCGCAGGCG TATTGACCAG 50 

TGCATGTGCC ATGGATAAAA TCATGACCAA GTATATTTTA CAAGCTGCTG 10 0 

GTGTGCCGCA AGTTCCTTAT GTACCAGTAC TTAAGAATCA ATGGAAAGAA 150 

AATCCTAAAA AAGTATTTGA TCAATGTGAA GGTTCTTTGC TTTATCCGAT 200 

GTTTGTCAAA CCTGCGAATA TGGGTTCTAG TGTCGGCATT ACAAAGGCAG 250 

25 AAAACCGAGA AGAGCTGCAA AATGCTTTAG CAACAGCCTA TCAGTATGAT 3 00 

TCTCGAGCAA TCGTTGAACA AGGAATTGAA GCGCGCGAAA TCGAAGTTGC 350 

TGTATTAGGA AATGAAGATG TTCGGACGAC TTTGCCTGGC GAAGTCGTAA 4 00 

AAGACGTAGC ATTCTATGAT TATGAAGCCA AATATATCAA TAATAAAATC 4 50 

GAAATGCAGA TTCCAGCCGA AGTGCCGGAA GAAGTTTATC AAAAAGCGCA 5 00 

30 AGAGTACGCG AAGTTAGCTT ACACGATGTT AGGTGGAAGC GGATTGAGCC 5 50 

GGTGCGATTT CTTTTTGACA AATAAAAATG AATTATTCCT GAATGAATTA 600 



35 2) INFORMATION FOR SEQ ID NO: 1243 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 22 75 bases 

(B) TYPE: Nucleic acid 
40 (C) STRANDEDNESS: Double 

(D) TOPOLOGY: Linear 

(ii) MOLECULE TYPE: Genomic DNA 

45 (vi) ORIGINAL SOURCE: 

(A) ORGANISM: Enterococcus faecetlis 

(C) ACCESSION NUMBER: M38386 



50 



(xv) SEQUENCE DESCRIPTION: SEQ ID NO: 1243 



GGTACCAAAG AAAAAAACGA ACGCCACAAC CAACAGCCTC TAAAGCAACA 50 

CCTGCTTCTG AAATTGAGGG AGATTTAGCA AATGTCAATG AGATTCTTTT 100 

GGTTCACGAT GATCGTGTCG GGTCAGCAAC GATGGGAATG AAAGTCTTAG 150 

AAGAAATTTT AGATAAAGAG AAAATTTCAA TGCCGATTCG AAAAATTAAT 2 00 

55 ATTAATGAAT TAACTCAACA AACACAGGCT TTAATTGTCA CAAAAGCTGA 250 

ACTAACGGAA CAAGCACGTA AAAAAGCACC GAAAGCGACA CACTTATCAG 300 

TAAAAAGTTA TGGTTAATCC CCAAAAATAT GAAACAGTGG GTTTCGCTCT 3 50 

TAAAAGAAAG TGCCTAGAGA GGAAGAAAAC AATGGAAAAT CTTACGAATA 4 00 

TTTCAATTGA ATTAAATCAA CAGTTTi\ATA CAAAAGAAGA AGCTATTCGC 450 

60 TTTTCCGGCC AGAAACTAGT CGAGGCAGGC TGTGTTGAGC CCGCTTATAT 500 
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CGAAGCAATG ATTGA7VAGAG ACCAATTGCT ATCTGCCCAT ATGGGGAATT 550 

TTATTGCCAT TCCTCATGGA ACAGAAGAAG CCT^AAAAATT AGTGAAAAAA 600 

TCAGGAATCT GTGTAGTGCA AGTCCCAGAG GGCGTTAATT TTGGCACCGA 650 

AGAAGATGAA AAAATTGCTA CCGTATTATT TGGGATTGCC GGAGTCGGTG 700 

5 AAGAACATTT GCAATTAGTC CAACAAATTG CACTTTATTG TAGTGATATG 750 

GATAACGTGG TGCAACTTGC CGATGCATTA AGTAAAGAAG AAATAACAGA 800 

AAATTTAGCC ATTGCTTAAA GGAGAGAATA AGAATGAACG CAGTACATTT 850 

TGGAGCAGGA AATATTGGAC GCGGCTTTAT TGGCGAAATT TTAGCTAAAA 900 

CGGGTTTCAT ATTACCGTTT GTGGATGTTA ATGGAAACCA TCATCAAGCG 950 

10 TTAAAAGAAC GTAAAAGTTA TACAATTGAA TTGGCCGATG CCTCACATCA 1000 

ACAAATTAAC GTTGAAAATG TGACCGGGTT A7U\TAACATG ACAGAACCAG 1050 

AAAAAGTAGT AGAAGCAATT GCGGAAGCCG ATTTAGTCAC GACGGCAATT 1100 

GGTCCTAATA TTTTACCAAG AATTGCTGAA TTAATTGCTC AAGGAATTGA 1150 

TGCACGTGCC GAAGCAAATT GTCAAAACGG CCCGCTGGAT ATTATCGCTT 12 00 

15 GTGAAAATAT GATTGGTGGT TCAACCTTTT TAGCAGAAGA AGTGGCCATA 1250 

ATATTTGAAA AACCCAGCTT ATCTGAACAA TGGATTGGTT TTCCTGATGC 1300 

GGCAGTTGAT CGGATTGTTC CATTACAAAA ACATAAAGAT CCACTTTTTG 13 50 

TTCAAGTTGA GCCTTTTTGT GAATGGGTCA TTGATGATAC CAACCGAAAA 1400 

GCCAAAGAGA TTCAGTTAGA AGGCGTCATT ACTTGTCGAT TAGAGCCGTA 1450 

20 TATTGAACGA AAATTATTTA GTGTAACCAG TGGCCATGCT ACAGTTGCCT 1500 

ATACAGGGGC GTTGTTAGGC TATCAAACCA TTGACGAAGC GATGCAGGAC 1550 

GCCTTAGTGG TAGCGCAACT CAAATCAGTT TTGCAGGAAA CCGGTAAACT 1600 

TTTAGTGGCC AAATGGAATT TTGATGAACA AGAACATGCA GCCTATATTG 1650 

AAAAAATTAT CAACCGTTTC CAAAATAAAT ATATTTCAGA TGCTATTACA 1700 

25 CGTGTAGCAC GGACACCAAT CAGAAAATTA GGTGCGCAAG AACGGTTTAT 175 0 

TCGACCAATC CGTGAATTAC AGGAACGCAA TCTAGTGTCG GCCGCATTTA 1800 

TAGC7ATGAT TGGTATTGTC TTTAATTATC ATGATCCAGA AGATGAACAA 1850 

AGCCGTCAAT TACAGGAAAT GCTTGACCAA GAAAGTGTTG ATACAGTGGA 1900 

TCGCTGAAGT AACGGGCATT GAAGATCCAG AT^CGGTTAA AAATATTAAA 1950 

30 CAAAACGTAG AACTGCTATG CGCGACCACA AGTAGCATAA TTAACAAAAT 2 000 

CCTTCTACCA AGATACTTCA CATTTCTTAA TTAAAGAAAA AACAACCGCG 2050 

CCTCACCTGA GCCGACCCCC AAAAGTTAGA CCTAGAAATC TAACTTTTGG 2100 

AGGTTTTTTT GTATGGCAAA ATACAGTTTT GAAATTTAAA CTTAAACTTG 215 0 

TTCATGACTA CTTATATGGT CAAGGAGGTC TAAGGTTTCT CGCAAAGAAG 22 0 0 

35 TATGGGTTTA AAGATAGTCT CAAATAAGCA AATGGATAAA TGCCTATAAA 22 50 

GAACTTGGTG AAGAAGGGGG GATCC 2275 



40 2) INFORMATION FOR SEQ ID NO: 1244 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 442 bases 

(B) TYPE: Nucleic acid 
45 (C) STRANDEDNESS : Double 

(D) TOPOLOGY: Linear 

(ii) MOLECULE TYPE: Genomic DNA 

50 (vi) ORIGINAL SOURCE: 

(A) ORGANISM: Staphylococcus aureus subsp, aureus 

(B) STRAIN: ATCC 25923 



55 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 1244 



GATCAATCTT TGTCGGTACA CGATATTCTT CACGACTAAA TAAACGCTCA 50 

TTCGCGATTT TATAAATGAA TGTTGATAAC AATGTTGTAT TATCTACTGA 100 

AATCTCATTA CGTTGCATCG GAAACATTGT GTTCTGTATG TAAAAGCCGT 150 

CTTGATAATC TTTAGTAGTA CCGT^GCTGG TCATACGAGA GTTATATTTT 2 00 

60 CCAGCCAAAA CGATATTTTT ATAATCATTA CGTGAAAAAG GTTTCCCTTC 250 
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ATTATCACAC AAATATTTTA GCTTTTCAGT 
CTTTATCCAT ACGTTGAATA ATTGTACGAT 
ACACCTTTAA TGTTATTTGT TTTAAAAGCA 
ACGATGTGAA TCTTCTAAGA AGTCACCGTA 
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TTCTATATCA ACTGTAGCTT 300 

TCTGACGCAC CATCTTTTGC 350 

TGAATAAGTT TTTCAACACA 400 

AAATGAAGGA TC 442 
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2) INFORMATION FOR SEQ ID NO: 1245 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 845 bases 

(B) TYPE: Nucleic acid 

(C) STRANDEDNESS: Double 

(D) TOPOLOGY: Linear 

(ii) MOLECULE TYPE: Genomic DNA 

(vi) ORIGINAL SOURCE: 

(A) ORGANISM: Bacillus anthracis 

(B) STRAIN: CIP 9444 

15 (xi) SEQUENCE DESCRIPTION: SEQ ID NO: 1245 

GATGGCGGAA AGCTACCAGA AATCTACAAC GCCCTTACGG TAAAACAGAG 50 

CAACGAAAAC GGAACAAGCA TTAACTTAAC ATTTGAAGTT GCACTTCATT 100 

TAGGTGATGA CACAGTTCGT ACAGTTGCAA TGTCTTCCAC AGATGGACTT 150 

2 0 GTTCGTGGCA CAGAAGTAGA AGATACTGGT AAAGCAATCT CTGTACCAGT 200 

TGGTGATGCA ACACTTGGTC GTGTATTTAA CGTATTAGGT GATGCAATTG 250 

ACTTAGATGG TGAGGTTCCT GCGGATGTAC GTCGTGATCC TU^TTCACCGT 300 

CAAGCACCTG CATTCGAAGA ATTATCTACT AAAGTAGAAA TTCTTGAAAC 350 

TGGTATTAAA GTAGTAGACT TACTTGCTCC TTACATTAAG GGTGGTAAGA 400 

25 TCGGTCTATT CGGTGGTGCC GGTGTAGGTA AAACGGTATT AATTCAGGAA 4 50 

TTAATCAATA ACATCGCACA AGAACACGGT GGTATCTCTG TATTCGCTGG 500 

TGTAGGTGAG CGTACTCGTG AGGGTAATGA CTTATACCAC GAAATGAGCG 550 

ATTCTGGCGT AATTAAGAAA ACTGCGATGG TATTCGGACA AATGAACGAG 600 

CCACCTGGAG CACGTCAACG TGTTGCGTTA ACAGGTTTAA CAATGGCTGA 650 

3 0 GCATTTCCGT GATGAGCAAG GACAAGATGT ACTTCTGTTC ATCGATAATA 700 

TCTTCCGTTT CACGCAAGCA GGTTCTGAAG TATCTGCCCT TCTTGGCCGT 750 
ATGCCATCTG CGGTAGGTTA CCAACCAACA CTTGCAACAG AAATGGGTCA 800 
ATTACAAGAG CGTATTACAT CTACAAATAA AGGGTCTATC ACGTC 845 

35 

2) INFORMATION FOR SEQ ID NO: 1246 

(i) SEQUENCE CHARACTERISTICS: 
40 (A) LENGTH: 656 bases 

(B) TYPE: Nucleic acid 

(C) STRANDEDNESS: Double 

(D) TOPOLOGY: Linear 

45 (ii) MOLECULE TYPE: Genomic DNA 

(vi) ORIGINAL SOURCE: 

(A) ORGANISM: Bacillus mycoides 

(B) STRAIN: ATCC 11986 



50 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 1246 

TGCACTTCAT TTAGGTGATG ACACAGTTCG TACAGTTGCA ATGTCTTCCA 50 
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CAGATGGACT TGTTCGTGGC ACAGTU^GTAG AAGATACTGG TAAAGCAATC 100 

TCTGTACCAG TTGGTGATGT AACACTTGGT CGTGTATTTA ACGTATTAGG 150 

TGATGCAATT GACTTAGATG GTGATGTTCC TGCGGATGTA CGTCGTGATC 200 

CAATTCACCG TCAAGCGCCT GCATTCGAAG AGTTATCTAC TAAAGTAGAA 250 

5 ATTCTTGAAA CTGGTATTAA AGTAGTAGAC TTACTTGCTC CTTACATTAA 300 

GGGTGGTAAG ATTGGTCTAT TCGGTGGTGC CGGCGTAGGT AT^AACAGTAT 350 

TAATTCAGGA ATTAATTAAT AACATCGCAC AAGAGCACGG TGGTATCTCT 4 00 

GTATTCGCTG GTGTAGGTGA GCGTACTCGT GAAGGTAACG ACTTATACCA 4 50 

CGAAATGAGC GATTCTGGCG TAATTAAGAA AACTGCGATG GTATTCGGAC 500 

10 AAATGAACGA GCCACCTGGA GCACGTCAAC GTGTTGCATT AACAGGTTTA 550 

ACAATGGCTG AACATTTCCG TGATGAGCAA GGACAAGACG TACTATTGTT 600 

CATCGATAAC ATCTTCCGTT TCACGCAAGC GGGTTCTGAA GTATCTGCCC 650 

TTCTTG 656 



15 



30 



50 



2) INFORMATION FOR SEQ ID NO: 1247 



(i) SEQUENCE CHARACTERISTICS: 
20 (A) LENGTH: 791 bases 

(B) TYPE: Nucleic acid 

(C) STRANDEDNESS: Double 

(D) TOPOLOGY: Linear 

25 (ii) MOLECULE TYPE: Genomic DNA 

(vi) ORIGINAL SOURCE: 

(A) ORGANISM: Bacillus thuringiensis 

(B) STRAIN: ATCC 10792 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 1247 



CGAAAACGGA AGTATTAACT TAACATTTGA AGTTGCACTT CATTTAGGTG 50 

ATGATACAGT TCGTACAGTT GCGATGTCTT CCACAGATGG ACTTGTTCGT 100 

35 GGCACAGAAG TAGAAGATAC TGGTAAACCA ATCTCTGTAC CAGTTGGTGA 150 

TGTAACACTT GGTCGCGTAT TTAACGTATT AGGTGATGCA ATTGACTTAG 200 

ATGGTGAGGT TCCTGCAGAT GTACATCGTG ATCCAATTCA CCGTCAAGCA 250 

CCTGCATTCG AAGAATTATC TACTAAAGTA GAAATTCTTG AAACTGGTAT 300 

TAAAGTAGTA GACTTACTTG CTCCTTACAT TAAGGGTGGT AAGATCGGCC 350 

40 TATTCGGTGG TGCCGGCGTA GGTAAAACAG TATTAATTCA GGAATTAATT 400 

AACAACATCG CACAAGAGCA CGGTGGTATC TCTGTATTCG CTGGTGTAGG 4 50 

TGAGCGTACT CGTGAGGGTA ATGACTTATA CCACGA7\ATG AGCGATTCTG 500 

GCGTAATCAA GAAAACTGCG ATGGTATTCG GACAAATGAA CGAGCCACCT 550 

GGAGCACGTC AACGTGTTGC ATTAACAGGT TTAACAATGG CTGAGCATTT 600 

45 CCGTGATGAG CAAGGACAAG ACGTACTTCT GTTCATCGAT AACATCTTCC 650 

GTTTCACGCA AGCGGGTTCT GAAGTATCTG CCCTTCTTGG TCGTATGCCA 700 

TCTGCGGTAG GTTACCAACC AACACTTGCA ACAGAAATGG GTCAATTACA 750 

AGAGCGTATT ACATCTACAA ATAAAGGGTC TATCACGTCT A 791 



2) INFORMATION FOR SEQ ID NO: 1248 
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(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 825 bases 

(B) TYPE: Nucleic acid 

(C) STRANDEDNESS: Double 
5 (D) TOPOLOGY: Linear 

(ii) MOLECULE TYPE: Genomic DNA 

(vi) ORIGINAL SOURCE: 
10 (A) ORGANISM: Bacillus thurlngiensls 

(B) STRAIN: BGSC 4AC1 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 1248 

15 ATCTACAATG CCCTTACGGT AAAACAAAGC AACGAAi\ACG GAAGCATGAA 50 

CTTAACATTT GAAGTTGCAC TTCATTTAGG TGATGATACA GTTCGTACAG 100 

TTGCGATGTC TTCCACAGAT GGACTTGTTC GTGGCACAGA AGTAGAAGAT 150 

ACTGGTAAAG CAATCTCTGT ACCAGTTGGT GATGCAACAC TTGGACGTGT 200 

ATTCAACGTA TTAGGTGATG CAATTGACTT AGATGGTGAA CTTCCTGCGG 250 

2 0 ATGTACACCG TGATCCAATT CACCGTCAAG CACCTGCATT CGAAGAATTA 300 

TCTACTAAAG TAGAAATTCT TGAAACTGGT ATTAAAGTAG TAGACTTACT 350 

TGCTCCTTAC ATTAAGGGTG GTAAGATCGG CCTATTCGGT GGTGCCGGCG 4 00 

TAGGTAAAAC AGTATTAATT CAGGAGTTAA TCAATAACAT CGCACAAGAG 4 50 

CACGGTGGTA TCTCTGTATT CGCTGGTGTA GGTGAGCGTA CTCGTGAGGG 500 

25 TAATGACTTA TACCACGAAA TGAGCGATTC TGGCGTAATC AAGAAAACTG 550 

CGATGGTATT CGGACAAATG AACGAGCCAC CTGGAGCACG TCAACGTGTT 600 

GCATTAACAG GTTTAACTU^T GGCTGAGCAT TTCCGTGATG AGCAAGGACA 650 

AGACGTACTT CTGTTCATCG ATAACATCTT CCGTTTCACG CAAGCGGGTT 700 

CTGAAGTATC TGCCCTTCTT GGTCGTATGC CATCTGCGGT AGGTTACCAA 7 50 

3 0 CCAACACTTG CAACAGAAAT GGGTCAATTA CAAGAGCGTA TTACATCTAC 800 

AAAT7\AAGGG TCTATCACGT CTATC * 825 



35 2) INFORMATION FOR SEQ ID NO: 1249 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 775 bases 

(B) TYPE: Nucleic acid 
40 (C) STRANDEDNESS: Double 

(D) TOPOLOGY: Linear 

(ii) MOLECULE TYPE: Genomic DNA 

45 (vi) ORIGINAL SOURCE: 

(A) ORGANISM: Bacillus thurlngiensls 

(B) STRAIN: HER 1236 



50 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 1249 

ATCTACAACG CCCTTACGGT AAAACAAAGC AACGAAAACG GAAGTATTAA 50 
CTTAACATTT GAAGTTGCAC TTCATTTAGG TGATGATACA GTTCGTACAG 100 
TTGCGATGTC TTCCACAGAT GGACTTGTTC GTGGCACAGA AGTAGT^GAT 150 
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ACTGGTAAAC CAATCTCTGT ACCAGTTGGT GATGTAACAC TTGGTCGCGT 

ATTTAACGTA TTAGGTGATG CAATTGACTT AGATGGTGAG GTTCCTGCAG 

ATGTACATCG TGATCCAATT CACCGTCAAG CACCTGCATT CGAAGAATTA 

TCTACTAAAG TAG7\AATTCT TGAAACTGGT ATTAAAGTAG TAGACTTACT 

TGCTCCTTAC ATTAAGGGTG GTAAGATCGG CCTATTCGGT GGTGCCGGCG 

TAGGTAAAAC AGTATTAATT CAGGAATTAA TTAACAACAT CGCACAAGAG 

CACGGTGGTA TCTCTGTATT CGCTGGTGTA GGTGAGCGTA CTCGTGAGGG 

TAATGACTTA TACCACGAAA TGAGCGATTC TGGCGTAATC AAGAAAACTG 

CGATGGTATT CGGACAAATG AACGAGCCAC CTGGAGCACG TCAACGTGTT 

GCATTAACAG GTTTAACAAT GGCTGAGCAT TTCCGTGATG AGCAAGGACA 

AGACGTACTT CTGTTCATCG ATAACATCTT CCGTTTCACG CAAGCGGGTT 

CTGAAGTATC TGCCCTTCTT GGTCGTATGC CATCTGCGGT AGGTTACCAA 

CCAACACTTG CAACAGAAAT GGGTC 



200 
250 
300 
350 
400 
450 
500 
550 
600 
650 
700 
750 
775 



2) INFORMATION FOR SEQ ID NO: 1250 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 832 bases 

(B) TYPE: Nucleic acid 

(C) STRANDEDNESS: Double 

(D) TOPOLOGY: Linear 

(ii) MOLECULE TYPE: Genomic DNA 

(vi) ORIGINAL SOURCE: 

(A) ORGANISM: Bacillus weihenstephanensis 

(B) STRAIN: WSBC 10204 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 1250 

CCAGCAATCT ACAACGCCCT TACGGTAAAA CAAAGCAACG AAAACGGAGC 50 

GAGCATCAAC TTAACATTTG AAGTTGCACT TCATTTAGGT GATGACACAG 100 

TTCGTACAGT TGCGATGTCT TCCACAGATG GACTTGTTCG TGGCACAGAA 150 

GTAGAAGATA CTGGTAAAGC AATCTCTGTA CCAGTTGGTG ATGTAACACT 200 

TGGTCGCGTA TTCAACGTAT TAGGTGATGC AATTGACTTA GATGGTGATG 250 

TTCCTGCGGA TGTACGTCGT GATCCAATTC ACCGTCAAGC ACCTGCATTC 300 

GAAGAACTAT CTACAAGAAT AGAAATTCTT GAAACTGGTA TTAAAGTAGT 350 

AGATTTACTT GCTCCTTACA TTAAGGGTGG TAAGATCGGT CTATTCGGTG 400 

GTGCCGGTGT AGGTAAAACG GTATTAATTC AGGAATTAAT TAACAACATC 450 

GCACAAGAGC ACGGTGGTAT CTCTGTATTC GCTGGTGTAG GTGAGCGTAC 500 

TCGTGAGGGT AATGACTTAT ACCACGAAAT GAGCGATTCT GGCGTAATTA 550 

AGAAAACTGC GATGGTATTT GGACAAATGA ACGAGCCACC TGGAGCACGT 600 

CAACGTGTTG CATTAACAGG TTTAACAATG GCTGAACATT TCCGTGATGA 650 

GCAAGGACAA GACGTACTAT TGTTCATCGA TAACATCTTC CGTTTCACGC 700 

AAGCAGGTTC TGAAGTATCT GCCCTTCTTG GTCGTATGCC ATCTGCGGTA 750 

GGTTACC7VAC CAACACTTGC AACAGAAATG GGTCAATTAC AAGAGCGTAT 800 

TACATCTACA AATAAAGGGT CTATCACGTC TA 832 



2) INFORMATION FOR SEQ ID NO: 1251 
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(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 802 bases 

(B) TYPE: Nucleic acid 

5 (C) STRANDEDNESS: Double 

(D) TOPOLOGY: Linear 

(ii) MOLECULE TYPE: Genomic DNA 

10 (vi) ORIGINAL SOURCE: 

(A) ORGANISM: Bacillus thurxngiensis 

(B) STRAIN: HER 1418 



15 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 1251 



AAATCTACAA CGCCCTTACG GTAAAACAAA GCAACGAAAA CGGAAGTATT 50 

AACTTAACAT TTGAAGTTGC ACTTCATTTA GGTGATGATA CAGTTCGTAC 100 

AGTTGCGATG TCTTCCACAG ATGGACTTGT TCGTGGCACA GAAGTAGAAG 150 

ATACTGGTAA ACCAATCTCT GTACCAGTTG GTGATGTAAC ACTTGGTCGC 200 

2 0 GTATTTT^ACG TATTAGGTGA TGCAATTGAC TTAGATGGTG AGGTTCCTGC 250 

AGATGTACAT CGTGATCCAA TTCACCGTCA AGCACCTGCA TTCGAAGAAT 300 

TATCTACTAA AGTAGAAATT CTTGAAACTG GTATTAAAGT AGTAGACTTA 350 

CTTGCTCCTT ACATTAAGGG TGGTAAGATC GGCCTATTCG GTGGTGCCGG 400 

CGTAGGTAAA ACAGTATTAA TTCAGGAATT AATTAACAAC ATCGCACAAG 450 

25 AGCACGGTGG TATCTCTGTA TTCGCTGGTG TAGGTGAGCG TACTCGTGAG 500 

GGTAATGACT TATACCACGA AATGAGCGAT TCTGGCGTAA TCT^GAAAAC 550 

TGCGATGGTA TTCGGACAAA TGAACGAGCC ACCTGGAGCA CGTCAACGTG 600 

TTGCATTAAC AGGTTTAACA ATGGCTGAGC ATTTCCGTGA TGAGCAAGGA 650 

CAAGACGTAC TTCTGTTCAT CGATAACATC TTCCGTTTCA CGCAAGCGGG 700 

30 TTCTGAAGTA TCTGCCCTTC TTGGTCGTAT GCCATCTGCG GTAGGTTACC 750 

AACCAACACT TGCAACAGAA ATGGGTCAAT TACAAGAGCG TATTACATCT 800 

AC 802 



35 



45 



2) INFORMATION FOR SEQ ID NO: 1252 



(i) SEQUENCE CHARACTERISTICS: 
(A) LENGTH: 823 bases 
40 (B) TYPE: Nucleic acid 

(C) STRANDEDNESS: Double 

(D) TOPOLOGY: Linear 



(ii) MOLECULE TYPE: Genomic DNA 

(vi) ORIGINAL SOURCE: 

(A) ORGANISM: Bacillus thuringiensis 

(B) STRAIN: HER 1410 

50 (xi) SEQUENCE DESCRIPTION: SEQ ID NO: 1252 

AAATCTACAA CGCCCTTACG GT7y\AACAAA GCAACGAAAA CGGAAGTATT 50 
AACTTAACAT TTGAAGTTGC ACTTCATTTA GGTGATGATA CAGTTCGTAC 100 
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AGTTGCAATG TCTTCCACAG ATGGACTTGT 
ATACTGGTAA ACCAATCTCT GTACCAGTTG 
GTATTTAACG TATTAGGTGA TGCAATTGAC 
AGATGTACAT CGTGATCCAA TTCACCGTCA 
5 TATCTACTAA AGTAGAAATT CTTGAAACTG 
CTTGCTCCTT ACATTAAGGG TGGTAAGATC 
CGTAGGTAAA ACAGTATTAA TTCAGGAATT 
AGCACGGTGG TATCTCTGTA TTCGCTGGTG 
GGTAATGACT TATACCACGA AATGAGCGAT 

10 TGCGATGGTA TTCGGACAAA TGAACGAGCC 
TTGCATTAAC AGGCTT7U\CA ATGGCTGAGC 
CAAGACGTAC TTCTGTTCAT CGATAACATC 
TTCTGAAGTA TCTGCCCTTC TTGGTCGTAT 
AACCAACACT TGCi\ACAGAA ATGGGTCAAT 

15 ACAAATAAAG GGTCTATCAC GTC 



TCGTGGCACA GAAGTAGAAG 150 

GTGATGTAAC ACTTGGTCGC 200 

TTAGATGGTG AGGTTCCTGC 2 50 

AGCACCTGCA TTCGAAGAAT 300 

GTATTAAAGT AGTAGACTTA 350 

GGCCTATTCG GTGGTGCCGG 4 00 

AATTAACAAC ATCGCACAAG 4 50 

TAGGTGAGCG TACTCGTGAG 500 

TCTGGCGTAA TCAAGAAAAC 550 

ACCTGGAGCA CGTCAACGTG 600 

ATTTCCGTGA TGAGCAAGGA 650 

TTCCGTTTCA CGCAAGCGGG 700 

GCCATCTGCG GTAGGTTACC 7 50 

TACAAGAGCG TATTACATCT 800 

823 



2) INFORMATION FOR SEQ ID NO: 1253 



20 



(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 798 bases 

(B) TYPE: Nucleic acid 

(C) STRANDEDNESS : Double 
25 (D) TOPOLOGY: Linear 

(ii) MOLECULE TYPE: Genomic DNA 

(vi) ORIGINAL SOURCE: 
30 (A) ORGANISM: Bacillus cereus 

(B) STRAIN: ATCC 13472 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 1253 

35 CCAGAAATCT ACAATGCCCT TACGGTAAAA CAAAGCAACG AAAACGGAAG 50 

CATGAACTTA ACATTTGAAG TTGCACTTCA TTTAGGTGAT GATACAGTTC 100 

GTACAGTTGC GATGTCTTCC ACAGATGGAC TTGTTCGTGG CACAGAAGTA 150 

GAAGATACTG GTAAAGCAAT CTCTGTACCA GTTGGTGATG CAACACTTGG 200 

ACGTGTATTC AACGTATTAG GTGATGCAAT TGACTTAGAT GGTGAACTTC 250 

40 CTGCGGATGT ACACCGTGAT CCAATTCACC GTCAAGCACC TGCATTCGAA 300 

GAATTATCTA CTAAAGTAGA AATTCTTGAA ACTGGTATTA AAGTAGTAGA 350 

CTTACTTGCT CCTTACATTA AGGGTGGTAA GATCGGCCTA TTCGGTGGTG 4 00 

CCGGCGTAGG TAAAACAGTA TTAATTCAGG AATTAATCAA TAACATCGCA 450 

CAAGAGCATG GTGGTATCTC TGTATTCGCT GGTGTAGGTG AGCGTACTCG 500 

45 TGAGGGTAAT GACTTATACC ACGAAATGAG CGATTCTGGC GTAATCAAGA 550 

AAACTGCGAT GGTATTCGGA CAAATGAACG AGCCACCTGG AGCACGTCAA 600 

CGTGTTGCAT TAACAGGTTT AACAATGGCT GAGCATTTCC GTGATGAGCA 650 

AGGACAAGAC GTACTTCTGT TCATCGATAA CATCTTCCGT TTCACGCAAG 7 00 

CGGGTTCTGA AGTATCTGCC CTTCTTGGTC GTATGCCATC TGCGGTAGGT 750 

50 TACCAACCGA CACTTGCAAC AGAAATGGGT CAATTACAAG AGCGTATT 7 98 
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2) INFORMATION FOR SEQ ID NO: 1254 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 767 bases 

(B) TYPE: Nucleic acid 

(C) STRANDEDNESS: Double 

(D) TOPOLOGY: Linear 

(ii) MOLECULE TYPE: Genomic DNA 

(vi) ORIGINAL SOURCE: 

(A) ORGANISM: Bacillus cereus 

(B) STRAIN: ATCC 7064 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 1254 

CATTTGAAGT TGCACTTCAT TTAGGTGATG ACACAGTTCG TACAGTTGCA 50 

ATGTCTTCCA CAGATGGACT TGTTCGTGGC ACAGAAGTAG AAGATACTGG 100 

TAAAGCAATC TCTGTACCAG TTGGTGATGC AACACTTGGT CGTGTATTTA 150 

ACGTATTAGG TGATGCAATT GACTTAGATG GTGAGGTTCC TGCGGATGTA 200 

CGTCGTGATC CAATTCACCG TCAAGCACCT GCATTCGAAG AATTATCTAC 250 

TAAAGTAGAA ATTCTTGAAA CTGGTATTAA AGTAGTAGAC TTACTTGCTC 300 

CTTACATTAA GGGTGGTAAG ATCGGTCTAT TCGGTGGTGC CGGTGTAGGT 350 

AAAACGGTAT TAATTCAGGA ATTAATCAAT AACATCGCAC AAGAACACGG 4 00 

TGGTATCTCT GTATTCGCTG GTGTAGGTGA GCGTACTCGT GAGGGTAATG 4 50 

ACTTATACCA CGAAATGAGC GATTCTGGCG TAATTAAGAA AACTGCGATG 500 

GTATTCGGAC AAATGAACGA GCCACCTGGA GCACGTCAAC GTGTTGCGTT 550 

AACAGGTTTA ACAATGGCTG AGCATTTCCG TGATGAGCAA GGACAAGACG 600 

TACTTCTGTT CATCGATAAT ATCTTCCGTT TCACGCAAGC AGGTTCTGAA 650 

GTATCTGCCC TTCTTGGCCG TATGCCATCT GCGGTAGGTT ACCAACCAAC 700 

ACTTGCAACA GAAATGGGTC AATTACAAGA GCGTATTACA TCTACAAATA 750 

AAGGGTCTAT CACGTCT 7 67 



2) INFORMATION FOR SEQ ID NO: 1255 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 1174 bases 

(B) TYPE: Nucleic acid 

(C) STRANDEDNESS: Double 

(D) TOPOLOGY: Linear 

(ii) MOLECULE TYPE: Genomic DNA 

(vi) ORIGINAL SOURCE: 

(A) ORGANISM: Staphylococcus aureus 

(B) STRAIN: C-14 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 1255 

GAAATGCGTG AATCATTTTT AGATTATGCG ATGAGTGTTA TCGTTGCTCG 50 
TGCATTGCCA GATGTTCGTG ACGGTTTAAA ACCAGTACAT CGTCGTATAC 100 
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TATATGGATT AAATGT^CAA GGTATGACAC CGGATAAATC ATATAAAAAA 150 

TCAGCACGTA TCGTTGGTGA CGTAATGGGT AAATATCACC CTCATGGTGA 200 

CTTATCTATT TATGAAGCAA TGGTACGTAT GGCTCAAGAT TTCAGTTATC 250 

GTTATCCGCT TGTTGATGGC CAAGGTAACT TTGGTTCAAT GGATGGAGAT 300 

5 GGCGCAGCAG CAATGCGTTA TACTGAAGCG CGTATGACTA AAATCACACT 350 

TGAACTGTTA CGTGATATTA ATAAAGATAC AATAGATTTT ATCGATAACT 4 00 

ATGATGGTAA TGAAAGAGAG CCGTCAGTCT TACCTGCTCG ATTCCCTAAC 4 50 

TTATTAGCCA ATGGTGCATC AGGTATCGCG GTAGGTATGG CAACGAATAT 500 

TCCACCACAT AACTTAACAG AATTAATCAA TGGTGTACTT AGCTTAAGTA 550 

10 AGAACCCTGA TATTTCAATT GCTGAGTTAA TGGAGGATAT TGAAGGTCCT 600 

GATTTCCCAA CTGCTGGACT TATTTTAGGT AAGAGTGGTA TTAGACGTGC 650 

ATATGAAACA GGTCGTGGTT CAATTCAT^T GCGTTCTCGT GCAGTTATTG 7 00 

AAGAACGTGG AGNCGGACGT CAACGTATTG TTGTCACTGA AATTCCTTTC 750 

CAAGTGAATA AGGCTCGTAT GATTGAT^AAA ATTGCAGAGC TCGTTGGTGA 800 

15 CAAGAAAATT GACGGTATCA CTGATTTACG TGATGAAACA AGTTTACGTA 850 

CTGGTGTGCG TGTCGTTATT GATGTGCGTA AGGATGChAA TGCTAGTGTC 900 

ATTTTAAATA ACTTATACAA ACAAACACCT CTTCAAACAT CATTTGGTGT 950 

GAATATGATT GCACTTGTAA ATGGTAGACC GAAGCTTATT AATTTAAAAG 1000 

AAGCGTTGGT ACATTATTTA GAGCATCAAA AGACAGTTGT TAGAAGACGT 1050 

2 0 ACGCAATACA ACTTACGTAA AGCTAAAGAT CGTGCCCACA TTTTAGAAGG 1100 

ATTACGTATC GCACTTGACC ATATCGATGA AATTATTTCA ACGATTCGTG 1150 

AGTCAGATAC AGATAAAGTT GCAA 1174 



25 



35 



2) INFORMATION FOR SEQ ID NO: 1256 



(i) SEQUENCE CHARACTERISTICS: 
(A) LENGTH: 780 bases 
30 (B) TYPE: Nucleic acid 

(C) STRANDEDNESS: Double 

(D) TOPOLOGY: Linear 



(ii) MOLECULE TYPE: Genomic DNA 

(vi) ORIGINAL SOURCE: 

(A) ORGANISM: Bacillus weihenstephanensis 

(B) STRAIN: WSBC 10209 

40 (xi) SEQUENCE DESCRIPTION: SEQ ID NO: 1256 

ATCTACAACG CCCTTACGGT AAAACAAAGC AACGAAAACG GAGCAAGCAT 50 

TAACTTAACA TTTGAAGTTG CACTTCATTT AGGTGATGAC ACAGTTCGTA 100 

CAGTTGCT^AT GTCTTCCACA GATGGACTTG TTCGTGGCAC AGAAGTAGAA 150 

45 GATACTGGTA AAGCAATCTC TGTACCAGTT GGTGATGTAA CACTTGGTCG 200 

TGTATTTAAC GTATTAGGTG ATGCAATTGA CTTAGATGGA GATGTTCCTG 250 

CGGATGTACG TCGTGATCCA ATTCACCGTC AAGCGCCTGC ATTCGAAGAG 300 

TTATCTACTA AAGTAGAAAT TCTTGAAACT GGTATTAAAG TAGTAGACTT 350 

ACTTGCTCCT TACATTAAGG GTGGTAAGAT CGGTCTATTC GGTGGTGCCG 4 00 

50 GTGTAGGTAA AACAGTATTA ATTCAGGAAT TAATTAACAA CATCGCACAA 4 50 

GAGCACGGTG GTATCTCTGT ATTCGCTGGT GTAGGTGAGC GTACTCGTGA 500 

AGGTAACGAC TTATACCACG AAATGAGCGA TTCTGGCGTA ATTAAGAAAA 550 

CTGCGATGGT ATTCGGACAA ATGAACGAGC CACCTGGAGC ACGTCAACGT 600 
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GTTGCATTAA CAGGTTTAAC 
GCAAGACGTA CTATTGTTCA 
GTTCTGAAGT ATCTGCCCTT 
CAGCCAACAC TTGCAACAGA 



AATGGCTGAA CATTTCCGTG 
TCGATAACAT CTTCCGTTTC 
CTTGGTCGTA TGCCATCTGC 
AATGGGTCAA 



ATGAGCAAGG 650 
ACGCAAGCGG 700 
GGTAGGTTAC 750 

780 



2) INFORMATION FOR SEQ ID NO: 1257 

10 (i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 817 bases 

(B) TYPE: Nucleic acid 

(C) STRANDEDNESS: Double 

(D) TOPOLOGY: Linear 



15 



(ii) MOLECULE TYPE: Genomic DNA 



(vi) ORIGINAL SOURCE: 

(A) ORGANISM: Bacillus anthracis 
20 (B) STRAIN: CIP 7700 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 1257 

CTACCAGAAA TCTACAACGC CCTTACGGTA AAACAGAGCA ACGAAAACGG 50 

25 AACAAGCATT AACTTAACAT TTGAAGTTGC ACTTCATTTA GGTGATGACA 100 

CAGTTCGTAC AGTTGCAATG TCTTCCACAG ATGGACTTGT TCGTGGCACA 150 

GAAGTAGAAG ATACTGGTAA AGCAATCTCT GTACCAGTTG GTGATGCAAC 200 

ACTTGGTCGT GTATTTAACG TATTAGGTGA TGCAATTGAC TTAGATGGTG 250 

AGGTTCCTGC GGATGTACGT CGTGATCCAA TTCACCGTCA AGCACCTGCA 300 

30 TTCGAAGAAT TATCTACTAA AGTAGAAATT CTTGAAACTG GTATTAAAGT 350 

AGTAGACTTA CTTGCTCCTT ACATTAAGGG TGGTAAGATC GGTCTATTCG 400 

GTGGTGCCGG TGTAGGTAAA ACGGTATTAA TTCAGGAATT AATCAATAAC 450 

ATCGCACAAG AACACGGTGG TATCTCTGTA TTCGCTGGTG TAGGTGAGCG 500 

TACTCGTGAG GGTAATGACT TATACCACGA AATGAGCGAT TCTGGCGTAA 550 

35 TTAAGAAAAC TGCGATGGTA TTCGGACAAA TGAACGAGCC ACCTGGAGCA 600 

CGTCAACGTG TTGCGTTAAC AGGTTTAACA ATGGCTGAGC ATTTCCGTGA 650 

TGAGCAAGGA CAAGATGTAC TTCTGTTCAT CGATAATATC TTCCGTTTCA 700 

CGCAAGCAGG TTCTGAAGTA TCTGCCCTTC TTGGCCGTAT GCCATCTGCG 750 

GTAGGTTACC AACCAACACT TGCAACAGAA ATGGGTCAAT TACAAGAGCG 800 

40 TATTACATCT ACAAATA 817 



2) INFORMATION FOR SEQ ID NO: 1258 

45 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 829 bases 

(B) TYPE: Nucleic acid 

(C) STRANDEDNESS: Double 
50 (D) TOPOLOGY: Linear 

(ii) MOLECULE TYPE: Genomic DNA 



678 
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(vi) ORIGINAL SOURCE: 

(A) ORGANISM: Bacillus thuringiensis 

(B) STRAIN: HER 1404 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 1258 

CCAGT^TCT ACAACGCCCT TACGGTAAAA CAAAGCAACG AAAACGGAAG 50 

TATTAACTTA ACATTTGAAG TTGCACTTCA TTTAGGTGAT GATACAGTTC 100 

GTACAGTTGC GATGTCTTCC ACAGATGGAC TTGTTCGTGG CACAGAAGTA 150 

GAAGATACTG GTAAACCAAT CTCTGTACCA GTTGGTGATG TAACACTTGG 200 

TCGCGTATTT AACGTATTAG GTGATGCAAT TGACTTAGAT GGTGAGGTTC 250 

CTGCAGATGT ACATCGTGAT CCAATTCACC GTCAAGCACC TGCATTCGAA 300 

GAATTATCTA CTAAAGTAGA AATTCTTGAA ACTGGTATTA AAGTAGTAGA 350 

CTTACTTGCT CCTTACATTA AGGGTGGTAA GATCGGCCTA TTCGGTGGTG 4 00 

CCGGCGTAGG TAAAACAGTA TTAATTCAGG AATTAATTAA CAACATCGCA 4 50 

CAAGAGCACG GTGGTATCTC TGTATTCGCT GGTGTAGGTG AGCGTACTCG 500 

TGAGGGTAAT GACTTATACC ACGAAATGAG CGATTCTGGC GTAATCAAGA 550 

AAACTGCGAT GGTATTCGGA CAAATGAACG AGCCACCTGG AGCACGTCAA 600 

CGTGTTGCAT TAACAGGTTT AACAATGGCT GAGCATTTCC GTGATGAGCA 650 

AGGACAAGAC GTACTTCTGT TCATCGATAA CATCTTCCGT TTCACGCAAG 700 

CGGGTTCTGA AGTATCTGCC CTTCTTGGTC GTATGCCATC TGCGGTAGGT 750 

TACCAACCAA CACTTGCAAC AGAAATGGGT CAATTACAAG AGCGTATTAC 800 

ATCTACAAAT AAAGGGTCTA TCACGTCTA 829 



2) INFORMATION FOR SEQ ID NO: 1259 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 844 bases 

(B) TYPE: Nucleic acid 

(C) STRANDEDNESS: Double 

(D) TOPOLOGY: Linear 

(ii) MOLECULE TYPE: Genomic DNA 

(vi) ORIGINAL SOURCE: 

(A) ORGANISM: Bacillus cereus 

(B) STRAIN: ATCC 15816 

(C) ACCESSION NUMBER: 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 1259 

TGGCGGAAAG CTACCAGAAA TCTACAACGC CCTTACGGTA AAACAGAGCA 50 

ACGAAAACGG TGAACTTAAC TTAACATTTG AAGTTGCACT TCATTTAGGT 100 

GATGATACAG TTCGTACAGT TGCGATGTCT TCCACAGATG GACTTGTTCG 150 

TGGCACAGAA GTAGAAGATA CTGGTAAAGC AATCTCTGTA CCAGTTGGTG 200 

ATGCAACACT TGGTCGCGTA TTTAACGTAT TAGGTGATGC TATTGACTTA 2 50 

GATGGTGAGG TTCCTGCGGA TGTACGTCGT GATCCAATTC ACCGTCAAGC 300 

ACCTGCATTC GAAGAATTAT CTACTAAAGT AGAAATTCTT GAAACTGGTA 350 

TTAAAGTAGT AGACTTACTT GCTCCTTACA TTAAGGGTGG TAAGATCGGC 400 

CTATTCGGTG GTGCCGGTGT AGGTAAAACA GTATTAATTC AGGAGTTAAT 4 50 

CAACAACATC GCACAAGAGC ACGGTGGTAT CTCTGTATTC GCTGGTGTAG 500 
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GTGAGCGTAC TCGTGAGGGT 

GGCGTAATTA AGAAAACTGC 

TGGAGCACGT CAACGTGTTG 

TCCGTGATGA GCAAGGACAA 

CGTTTCACGC AAGCAGGTTC 

ATCTGCGGTA GGTTACCAAC 

AAGAGCGTAT TACATCTACA 



AATGACTTAT ACCACGi\AAT GAGCGATTCT 
GATGGTATTC GGACAAATGA ACGAGCCACC 
CATTAACAGG CTTAACAATG GCTGAATATT 
GACGTACTTC TGTTCATCGA TAATATCTTC 
TGAAGTATCT GCCCTTCTTG GCCGTATGCC 
CAACACTTGC AACAGAAATG GGTCAATTAC 
T^TAAAGGGT CTATCACGTC TATC 



550 
600 
650 
700 
750 
800 
844 



2) INFORMATION FOR SEQ ID NO: 1260 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 840 bases 

(B) TYPE: Nucleic acid 

(C) STRANDEDNESS: Double 

(D) TOPOLOGY: Linear 

(ii) MOLECULE TYPE: Genomic DNA 

(vi) ORIGINAL SOURCE: 

(A) ORGANISM: Bacillus cereus 

(B) STRAIN: ATCC 49064 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 1260 

AAGCTACCAG AAATCTACAA CGCCCTTACG GTAAAACAGA GCAACGAAAA 50 

CGGAACAAGC ATTAACTTAA CATTTGAAGT TGCACTTCAT TTAGGTGATG 100 

ACACAGTTCG TACAGTTGCA ATGTCTTCCA CAGATGGACT TGTTCGTGGC 150 

ACAGAAGTAG AAGATACTGG TAAAGCAATC TCTGTACCAG TTGGTGATGC 200 

AACACTTGGT CGTGTATTTA ACGTATTAGG TGATGCAATT GACTTAGATG 250 

GTGAGGTTCC TGCGGATGTA CGCCGTGATC CAATTCACCG TCAAGCACCT 300 

GCATTCGAAG AATTATCTAC TAAAGTAGAA ATTCTTGAAA CTGGTATTAA 350 

AGTAGTAGAC TTACTTGCTC CTTACATTAA GGGTGGTAAG ATCGGTCTAT 400 

TCGGTGGTGC CGGTGTAGGT AAAACAGTAT TAATTCAGGA ATTAATCAAC 4 50 

AACATCGCAC AAGAACACGG TGGTATCTCT GTATTCGCTG GTGTAGGTGA 500 

GCGTACTCGT GAGGGTAATG ACTTATACCA CGAAATGAGC GATTCAGGCG 550 

TAATTAAGAA AACTGCGATG GTATTCGGAC AAATGAACGA GCCACCTGGA 600 

GCGCGTCAAC GTGTTGCGTT AACAGGTTTA ACAATGGCTG AGCATTTCCG 650 

TGATGAGCAA GGACAAGACG TTCTTCTGTT CATCGATAAT ATCTTCCGTT 7 00 

TCACGCAAGC AGGTTCTGAA GTATCTGCCC TTCTTGGTCG TATGCCATCT 7 50 

GCGGTAGGTT ACCAACCAAC ACTTGCAACA GAAATGGGTC AATTACAAGA 8 00 

GCGTATTACA TCTACAAATA AAGGGTCTAT CACGTCTATC 840 



2) INFORMATION FOR SEQ ID NO: 1261 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 839 bases 

(B) TYPE: Nucleic acid 

(C) STRANDEDNESS: Double 

(D) TOPOLOGY: Linear 
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(ii) MOLECULE TYPE: Genomic DNA 



(vi) ORIGINAL SOURCE: 

(A) ORGANISM: Bacillus thuringiensis 

(B) STRAIN: BGSC 4AZ1 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 1261 



10 GCGGAAAGCT ACCAGAAATC TACAATGCCC TTACGGTAAA ACAAAGCAAC 50 

GAAAACGGAA GCATGAACTT AACATTTGAA GTTGCACTTC ATTTAGGTGA 100 

TGATACAGTT CGTACAGTTG CGATGTCTTC CACAGATGGA CTTGTTCGTG 150 

GCACAGAAGT AGAAGATACT GGTAAAGCAA TCTCTGTACC AGTTGGTGAT 200 

GCAACACTTG GACGTGTATT CAACGTATTA GGTGATGCAA TTGACTTAGA 250 

15 TGGTGAACTT CCTGCGGATG TACACCGTGA TCCAATTCAC CGTCAAGCAC 300 

CTGCATTCGA AGAATTATCT ACTAAAGTAG AAATTCTTGA AACTGGTATT 350 

AAAGTAGTAG ACTTACTTGC TCCTTACATT AAGGGTGGTA AGATCGGCCT 4 00 

ATTCGGTGGT GCCGGCGTAG GTAAAACAGT ATTAATTCAG GAGTTAATCA 4 50 

ATAACATCGC ACAAGAGCAC GGTGGTATCT CTGTATTCGC TGGTGTAGGT 500 

20 GAGCGTACTC GTGAGGGTAA TGACTTATAC CACGAAATGA GCGATTCTGG 550 

CGTAATCAAG AAAACTGCGA TGGTATTCGG ACAAATGAAC GAGCCACCTG 600 

GAGCACGTCA ACGTGTTGCA TTAACAGGTT TAACAATGGC TGAGCATTTC 650 

CGTGATGAGC AAGGACAAGA CGTACTTCTG TTCATCGATA ACATCTTCCG 700 

TTTCACGCAA GCGGGTTCTG AAGTATCTGC CCTTCTTGGT CGTATGCCAT 750 

25 CTGCGGTAGG TTACCAACCA ACACTTGCAA CAGAAATGGG TCAATTACAA 800 

GAGCGTATTA CATCTACAAA TAAAGGGTCT ATCACGTCT 839 



30 2) INFORMATION FOR SEQ ID NO: 1262 



(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 833 bases 

(B) TYPE: Nucleic acid 
35 (C) STRANDEDNESS: Double 

(D) TOPOLOGY: Linear 



(ii) MOLECULE TYPE: Genomic DNA 

40 (vi) ORIGINAL SOURCE: 

(A) ORGANISM: Bacillus thuringiensis 

(B) STRAIN: BGSC 4H2 



(xi) SEQUENCE DESCRIPTION: SEQ 

45 

AAGCTACCAG AAATCTACAA TGCCCTTACG 
CGGAAGCATG AACTTAACAT TTGAAGTTGC 
CAGTTCGTAC AGTTGCGATG TCTTCCACAG 
GAAGTAGAAG ATACTGGTAA AGCAATCTCT 
50 ACTTGGACGT GTATTCAACG TATTAGGTGA 
AACTTCCTGC GGATGTACAC CGTGATCCAA 
TTCGAAGAAT TATCTACTAA AGTAGAAATT 
AGTAGACTTA CTTGCTCCTT ACATTAAGGG 

68 



ID NO: 1262 

GTAAAACAAA GCAACGAAAA 50 

ACTTCATTTA GGTGATGATA 100 

ATGGACTTGT TCGTGGCACA 150 

GTACCAGTTG GTGATGCAAC 200 

TGCAATTGAC TTAGATGGTG 250 

TTCACCGTCA AGCACCTGCA 300 

CTTGAAACTG GTATTAAAGT 350 

TGGTT^GATC GGCCTATTCG 400 



SUBSTITUTE SHEET (RULE 26) 



wo 01/23604 

GTGGTGCCGG CGTAGGTAAA 

ATCGCACAAG AGCACGGTGG 

TACTCGTGAG GGTAATGACT 

TCAAGAAAAC TGCGATGGTA 

5 CGTCAACGTG TTGCATTAAC 

TGAGCAAGGA CAAGACGTAC 

CGCAAGCGGG TTCTGAAGTA 

GTAGGTTACC AACCAACACT 

TATTACATCT ACAAATAAAG 



ACAGTATTAA TTCAGGAATT 
TATCTCTGTA TTCGCTGGTG 
TATACCACGA AATGAGCGAT 
TTCGGACAAA TGAACGAGCC 
AGGTTTAACA ATGGCTGAGC 
TTCTGTTCAT CGATAACATC 
TCTGCCCTTC TTGGTCGTAT 
TGCAACAGAA ATGGGTCAAT 
GGTCTATCAC GTC 



PCT/CAOO/01150 

AATCAATAAC 450 

TAGGTGAGCG 500 

TCTGGCGTAA 550 

ACCTGGAGCA 600 

ATTTCCGTGA 650 

TTCCGTTTCA 700 

GCCATCTGCG 750 

TACAAGAGCG 800 

833 



2) INFORMATION FOR SEQ ID NO: 1263 

15 (i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 790 bases 

(B) TYPE: Nucleic acid 

(C) STRANDEDNESS: Double 

(D) TOPOLOGY: Linear 



20 



(ii) MOLECULE TYPE: Genomic DNA 



(vi) ORIGINAL SOURCE: 

(A) ORGANISM: Bacillus thuringiensis 
25 (B) STRAIN: BGSC 4Q1 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 1263 

CGAAAACGGA AGCATGAACT TAACATTTGA AGTTGCACTT CATTTAGGTG 50 

30 ATGATACAGT TCGTACAGTT GCGATGTCTT CCACAGATGG ACTTGTTCGT 100 

GGCACAGAAG TAGAAGATAC TGGTAAAGCA ATTTCTGTAC CAGTTGGTGA 150 

TGTAACACTT GGACGTGTAT TCAACGTATT AGGTGATGCA ATTGACTTAG 200 

ATGGTGAACT TCCTGCGGAT GTACACCGTG ATCCAATTCA CCGTCAAGCA 250 

CCTGCATTCG AAGAATTATC TACTAAAGTA GAAATTCTTG AAACTGGTAT 300 

35 T/y^GTAGTA GACTTACTTG CTCCTTACAT TAAGGGTGGT AAGATCGGCC 350 

TATTCGGTGG TGCCGGTGTA GGTAAAACAG TATTAATTCA GGAATTAATT 400 

AACAACATCG CACAAGAGCA CGGTGGTATC TCTGTATTCG CTGGTGTAGG 4 50 

TGAGCGTACT CGTGAGGGTA ATGACTTATA CCACGAAATG AGCGATTCTG 500 

GCGTAATCAA GAAAACTGCG ATGGTATTCG GACAAATGAA CGAGCCACCT 550 

4 0 GGAGCACGTC AACGTGTTGC ATTAACAGGT TTAACAATGG CTGAGCATTT 600 

CCGTGATGAG CAAGGACAAG ACGTACTTCT GTTCATCGAT AACATCTTCC 650 

GTTTCACGCA AGCGGGTTCT GAAGTATCTG CCCTTCTTGG TCGTATGCCA 700 

TCTGCGGTAG GTTACCAACC AACACTTGCA ACAGAAATGG GTCAATTACA 750 

AGAGCGTATT ACATCTACAA ATAAAGGGTC TATCACGTCT 7 90 

45 



2) INFORMATION FOR SEQ ID NO: 1264 

50 (i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 64 4 bases 

(B) TYPE: Nucleic acid 

(C) STRANDEDNESS: Double 

682 
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(D) TOPOLOGY: Linear 



PCT/CAOO/01150 



(ii) MOLECULE TYPE: Genomic DNA 



5 (vi) ORIGINAL SOURCE: 

(A) ORGANISM: Bacillus thuringiensis 

(B) STRAIN: HER 1232 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 1264 

10 

AGTTGCACTT CATTTAGGTG ATGATACAGT TCGTACAGTT GCGATGTCTT . 50 

CCACAGATGG ACTTGTTCGT GGCACAGAAG TAGAAGATAC TGGTAAACCA 100 

ATCTCTGTAC CAGTTGGTGA TGTAACACTT GGTCGCGTAT TTAACGTATT 150 

AGGTGATGCA ATTGACTTAG ATGGTGAGGT TCCTGCAGAT GTACATCGTG 200 

15 ATCCAATTCA CCGTCAAGCA CCTGCATTCG AAGAATTATC TACTAAAGTA 250 

GAAATTCTTG AAACTGGTAT TAAAGTAGTA GACTTACTTG CTCCTTACAT 300 

TAAGGGTGGT AAGATCGGCC TATTCGGTGG TGCCGGCGTA GGTAAAACAG 350 

TATTAATTCA GGAATTAATT AACAACATCG CACAAGAGCA CGGTGGTATC 400 

TCTGTATTCG CTGGTGTAGG TGAGCGTACT CGTGAGGGTA ATGACTTATA 4 50 

20 CCACGAAATG AGCGATTCTG GCGTAATCAA GAAAACTGCG ATGGTATTCG 500 

GACAAATGAA CGAGCCACCT GGAGCACGTC AACGTGTTGC ATTAACAGGT 550 

TTAACAATGG CTGAGCATTT CCGTGATGAG CAAGGACAAG ACGTACTTCT 600 

GTTCATCGAT AACATCTTCC GTTTCACGCA AGCGGGTTCT GAAG 644 

25 



2) INFORMATION FOR SEQ ID NO: 1265 



(i) SEQUENCE CHARACTERISTICS: 
30 (A) LENGTH: 823 bases 

(B) TYPE: Nucleic acid 

(C) STRANDEDNESS: Double 

(D) TOPOLOGY: Linear 

35 (ii) MOLECULE TYPE: Genomic DNA 



(vi) ORIGINAL SOURCE: 

(A) ORGANISM: Bacillus anthracis 

(B) STRAIN: ATCC 4229 

40 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 1265 



GGCGGAAAGC TACCAGAAAT 
CGAAAACGGA ACAAGCATTA 

45 GTGATGACAC AGTTCGTACA 
CGTGGCACAG AAGTAGAAGA 
TGATGCT^CA CTTGGTCGTG 
TAGATGGTGA GGTTCCTGCG 
GCACCTGCAT TCGAAGAATT 

50 TATTA7\AGTA GTAGACTTAC 
GTCTATTCGG TGGTGCCGGT 
ATCAATAACA TCGCACAAGA 
AGGTGAGCGT ACTCGTGAGG 



CTACAACGCC CTTACGGTAA 
ACTTAACATT TGAAGTTGCA 
GTTGCAATGT CTTCCACAGA 
TACTGGTAAA GCAATCTCTG 
TATTTAACGT ATTAGGTGAT 
GATGTACGTC GTGATCCAAT 
ATCTACTAAA GTAGAAATTC 
TTGCTCCTTA CATTAAGGGT 
GTAGGTAAAA CGGTATTAAT 
ACACGGTGGT ATCTCTGTAT 
GTAATGACTT ATACCACGAA 

683 



AACAGAGCAA 50 

CTTCATTTAG 100 

TGGACTTGTT 150 

TACCAGTTGG 200 

GCAATTGACT 250 

TCACCGTCAA 300 

TTGAAACTGG 350 

GGTAAGATCG 400 

TCAGGAATTA 4 50 

TCGCTGGTGT 500 

ATGAGCGATT 550 
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CTGGCGTAAT TAAGAAAACT 
CCTGGAGCAC GTCAACGTGT 
TTTCCGTGAT GAGCAAGGAC 
TCCGTTTCAC GCAAGCAGGT 
5 CCATCTGCGG TAGGTTACCA 
ACAAGAGCGT ATTACATCTA 



GCGATGGTAT TCGGACAAAT 
TGCGTTAACA GGTTTAACAA 
AAGATGTACT TCTGTTCATC 
TCTGAAGTAT CTGCCCTTCT 
ACCAACACTT GCAACAGAAA 
CAA 



GAACGAGCCA 600 
TGGCTGAGCA 650 
GATAATATCT 700 
TGGCCGTATG 750 
TGGGTCAATT 800 

823 



10 2) INFORMATION FOR SEQ ID NO: 1266 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 715 bases 

(B) TYPE: Nucleic acid 
15 (C) STRANDEDNESS: Double 

(D) TOPOLOGY: Linear 

(ii) MOLECULE TYPE: Genomic DNA 

2 0 (vi) ORIGINAL SOURCE: 

(A) ORGANISM: Paracoccidioides brasiliensis 

(B) STRAIN: ATCC 200443 



25 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 1266 



TGGTCCGAGR CCCGATTCMA TGAAATTATC AAGGAAACCT CCAAYTTCAT 50 

TAAGAAGGTC GGATATAACC CCAAGACTGT TCCTTTCGTT CCCATTTCTG 100 

GTTTCCAGGG TGACAACATG ATCGATSCCT CTGCCAACTG CCCATGGTAC 150 

AAGGGCTGGT ACMAKGAGAC TGCCGACAGG CAAGYACTCT GGCAAGACCC 200 

3 0 TTCTTGAGGC CATTGACGSC ATTGAGCCCC CCAMSCGTCC TWCCGATAAA 250 

CCTCTCCGTC TTCCTCTCCA GGATGTCTAC AAGATCTCCG GTATTGGMAC 300 

TGTTCCTGTC GGACGTRTTG AGACTGGAGT CATCAAGCCC GGTATGGTCG 350 

TGACCTTCGC TCCCGCCAAC GTCACCACTG AAGTCAAGTC CGTTGAAATG 400 

CACCACCAGC AGCTTTCCGA CGGTAWCCCC GGTGACAACG TCGGCTTCAA 450 

35 CGTCAAGAAT GTTTCCGTCA AAGAAGTCCG CCGTGGTAAC GTTGCCTGGT 500 

GACTCTAAGA ATGATCCCGC MAWGGGCTGC GATTCCTTCA ATGCYCAGGT 550 

CATCGTCCTC AACCACCCTG GTCAGGTTGG CGCTGGTTAT GCCCCAGTCC 600 

TCGAYTGCCA TACTGCCCAC ATTGCYTGCA ARTTCGCTGA GMTCMAKGAG 650 

AAGATTGAYC GCCGAACCGG MAAGTCTGTT GAGAACGCCC CCAAGTTCAT 700 

40 CAAGTCCGGT GATGC 715 



45 



2) INFORMATION FOR SEQ ID NO: 1267 



(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 875 bases 

(B) TYPE: Nucleic acid 

(C) STRANDEDNESS: Double 
50 (D) TOPOLOGY: Linear 

(ii) MOLECULE TYPE: Genomic DNA 



684 
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(vi) ORIGINAL SOURCE: 

(A) ORGANISM: Blastomyces dermatitidls 

(B) STRAIN: ATCC 56220 

5 (xi) SEQUENCE DESCRIPTION: SEQ ID NO: 1267 

GAGTCCTCTT ATTTACTTTT GTCATGACTA CCTTACTAAT CTGTCATAGA 50 

TCGTTACAAC GAAATCGTCA AGGAGACTTC CAACTTCATC AAGAAGGTCG 100 

GATACAACCC CAAGAACGTT CCTTTCGTTC CTATCTCCGG TTTCAACGGC 150 

10 GACAACATGC TTGAGCCCTC CCCCAACTGC CCCTGGTACA AGGGTTGGGA 200 

GAAGGAGACC AAGGCCGGTA AGGTCACTGG TAAGACCCTC CTCGAGGCCA 250 

TCGACGCCAT TGAGCCCCCT ACCCGTCCCG CCAACAAGGT CAGTACTACC 300 

TCAATTACTT GAACTCTCTT CATACGTTCC GATTACTGAC TGCTTCACAG 350 

CCCCTCCGTC TTCCCCTCCA GGACGTTTAC AAGATCGGTG GTATTGGAAC 4 00 

15 GGTGCCCGTC GGTCGTGTTG AGACCGGTAC CATCTCCCCT GGTATGGTCG 450 

TTACCTTGTA TGTATCCTGA CCATCCCCCT TGGCAATCAT TACGTACTAA 500 

CTCACTCTTC AGCGCTCCCG CGAACGTCAC CACTGAAGTC AAGAGTGTTG 550 

AAATGCACCA CCAGCAGCTC GCTGCCGGTC AGCCCGGTGA CAACGTTGGT 600 

TTCAACGTGA AGAACGTCTC CGTCAAGGAA ATCCGTCGTG GTAACGTTGC 650 

20 TGGTGATAGC AAGAACGACC CCCCTGCCGG TGCTGCTTCC TTCAACGCCC 700 

AGGTCATCGT CCTCAACCAC CCCGGTCAGG TCGGTGCTGG TTACGCCCCA 750 

GTCCTTGACT GCCACACTGC CCACATTGCT TGCAAGTTCT CTGAACTCCT 800 

TGAGAAGATT GACCGTCGTA CCGGAAAGTC TGTTGAGGAC CACCCCAAGT 850 

TCATCAAGTC CGGTGACGCT GCCAT 875 

25 



2) INFORMATION FOR SEQ ID NO: 1268 

30 (i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 1124 bases 

(B) TYPE: Nucleic acid 

(C) STI^NDEDNESS: Double 

(D) TOPOLOGY: Linear 



35 



(ii) MOLECULE TYPE: Genomic DNA 



(vi) ORIGINAL SOURCE: 

(A) ORGANISM: Histoplasma capsulatum 
40 (B) STRAIN: WSA-377 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 1268 

GTGAGCGTGG TATCACCATC GATATTGCCC TCTGGAAATT CGAGACCCCG 50 

45 AAGTACAGTG TCACTGTCAT TGGTGAGTGC TTTTTACCCC TCTTAAGCAG 100 

ATTTCAACTT CCAGAGTATC TACTCTAACA TATCCGCTTA GATGCTCCCG 150 

GCCATCGTGA CTTCATCAAG AACATGATCA CTGGTACCTC CCAGGCTGAC 200 

■ TGCGCTATCC TCATCATTGC TGCCGGTACT GGTGAGTTCG AGGCTGGTAT 250 

CTCCAAGGAT GGCCAGACTC GTGAGCACGC TCTGCTTGCT TTCACCCTTG 300 

50 GTGTGAGGCA ACTCATCGTT GCCATCAACA AGATGGACAC CACCAAGTGG 350 

TCCGAGTCCC GTTTCAACGA AATCATCAAG GAGGTTTCCA ACTTCATCAA 400 

GAAGGTCGGA TATAACCCCA AGGCTGTTCC CTTCGTGCCA ATCTCTGGTT 450 

TCGAGGGTGA CAACATGATT GAACCCTCCC CCAACTGCAC CTGGTACAAG 500 
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GGCTGGAACA 
CGATGCCATT 
TCCGTCTTCC 
CCCGTCGGAC 
TTTCGCTCCC 
ACCAACAACT 
AAGAACGTTT 
CAAAAATGAT 
TCCTTAACCA 
TGCCACACTG 
CGACCGCCGT 
CTGGTGATGC 
GAGCCCTTCA 



AGGAGACTGC 
GACGCCATTG 
CCTCCAGGAT 
GTGTTGAGAC 
TCCAACGTCA 
CCAGGCTGGT 
CAGTCAAGGA 
CCCCCCAAGG 
CCCCGGCCAG 
CCCACATTGC 
ACTGGAAAGT 
TGCTATCGTC 
CTGACTATCC 



CTCTGGCAAG 
AACCCCCAAC 
GTTTACAAAA 
TGGTGTCATC 
CCACTGAAGT 
TACCCTGGTG 
AGTCCGCCGT 
GCTGCGAATC 
GTTGGCGCTG 
TTGCAAGTTC 
CTGTTGAGAA 
AAGATGGTTC 
CCCT 



TCTTCTGGTA AAACCCTTCT 
CCGTCCTACC GATAAGCCTC 
TCTCTGGTAT TGGCACTGTT 
AAGCCCGGTA TGGTCGTGAC 
CAAGTCCGTC GAGATGCACC 
ACAACGTCGG CTTCAACGTC 
GGCAACGTTG CTGGCGACTC 
CTTCAATGCC CAGGTCATCG 
GTTATGCCCC AGTCCTCGAC 
TCTGAACTCA TTGAGAAGAT 
CAACCCCAAG TTCATCAAGT 
CCTCCi\AGCC CATGTGCGTG 



550 
600 
650 
700 
750 
800 
850 
900 
950 
1000 
1050 
1100 
1124 



2) INFORMATION FOR SEQ ID NO: 1269 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 1043 bases 

(B) TYPE: Nucleic acid 

(C) STRANDEDNESS: Double 

(D) TOPOLOGY: Linear 

(ii) MOLECULE TYPE: Genomic DNA 

(vi) ORIGINAL SOURCE: 

(A) ORGANISM: Trichophyton rub rum 

(B) STRAIN: WSA-224 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 1269 

GTGAGCGTGG TATCACCATC GATATCGCCC TCTGGAAGTT CGAGACCCCC 50 

AAGTACAATG TCACCGTCAT TGGTATGTTT CTTTGCCTTG TTCCCTCATG 100 

TGGTTGTACC ATATCTAACG AGAGTAGACG CCCCCGGTCA CCGTGACTTC 150 

ATCAAGAACA TGATCACTGG TACCTCCCAG GCTGACTGCG CTATTCTCAT 200 

CATTGCTGCC GGTACTGGTG AGTTCGAGGC TGGTATCTCC AAGGATGGCC 250 

AGACCCGTGA GCACGCTCTG CTCGCCTTCA CCCTCGGTGT CAAGCAGCTC 300 

ATCGTTGCCA TCAACAAGAT GGACACCACC GGCTGGTCCG AGGATCGTTT 350 

CAAGGAAATT ATCAAGGAAG TCACCAACTT CATCAAGAAG GTTGGCTACG 4 00 

ACCCCAAGGG TGTTCCATTC GTTCCAATCT CTGGTTTCAA CGGTGACAAC 4 50 

ATGATTGAGG CCTCCACCAA CTGCCCATGG TACAAGGGAT GGAACAAGGA 500 

GACCAAGGCC GGTGGTGCCA AGTCCGGCAA GACCCTCCTC GAGGCCATCG 550 

ATGCCATCGA CATGCCAACC CGTCCTACCG ACAAGCCCCT CCGTCTCCCA 600 

CTCCAGGATG TCTACAAGAT CTCTGGTATC GGAACTGTGC CAGTCGGTCG 650 

TGTTGAGACC GGTATCATCA AGCCCGGTAT GGTCGTCACC TTNGCCCCCG 7 00 

CCAACGTCAC CACTGAAGTC AAGTCCGTYK AAATGCACCA CCAGCAGCTT 750 

CAGCAGGGTG TCCCCGGTGA CAACGTCGGC TTCAATGTCA AGAACGTTTC 800 

CGTCAAGGAA GTCCGCCGTG GTAACGTTGC CGGTGACTCC AAGAACGACC 850 

CACCATCCGG CTGTGCCTCC TTCAACGCCC AGGTCATYGT CCTCAACCAC 900 

CCCGGCCAGA TCGGTGCTGG TTACGSTCCA GTCCTCGACT GCCACACTGS 950 

TCACATTGCT TGCAAGTTCG CTGAGCTCCT CGAGAAGATT GACCGCCGTA 1000 

CCGGTAAATC CGTCGAAGCC AACCCCAAGT TCGTCAAGTC TGG 1043 
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2) INFORMATION FOR SEQ ID NO: 1270 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 1105 bases 

(B) TYPE: Nucleic acid 

(C) STRANDEDNESS: Double 

(D) TOPOLOGY: Linear 

(ii) MOLECULE TYPE: Genomic DNA 

(vi) ORIGINAL SOURCE: 

(A) ORGANISM: Microsporum canis 

(B) STRAIN: WSA-217 

15 (xi) SEQUENCE DESCRIPTION: SEQ ID NO: 1270 

GCTGAGCGTG AGCGTGGTAT CACCATTGAT ATCGCCCTCT GGAAGTTCGA 50 

GACCCCCAAG TACATGGTCA CCGTCATCGG TATGCTTTAT CTGTTTCCCA 100 

TTTATAGTTG CGACCAGTAA CTAACAAAAA GTAGATGCCC CCGGGCACCG 150 

20 TGACTTCATC AAGAACATGA TTACTGGTAC CTCCCAGGCC GACTGCGCTA 200 

TTCTCATCAT TGCTGCCGGT ACTGGTGAGT TCGAGGCTGG TATCTCCAAG 250 

GATGGCCAGA CTCGTGAGCA CGCCCTGCTC GCTTTCACCC TCGGTGTCAA 300 

GCAGCTCATC GTTGCCATCA ACAAGATGGA CACCACCAAC TGGTCTGAGT 350 

CCCGTTTCGG TGAAATCATC AAGGAAGTCA CCAACTTCAT CAAGAAGGTC 4 00 

25 GGCTACGACC CCAAGGGTGT CCCATTCGTC CCAATCTCTG GCTTCAACGG 450 

TGACAACATG ATTGAGCCCT CCACCAACTG CCCATGGTAC AAGGGATGGA 500 

ACAAGGAGAC CAAGGCCGGT GGCAAATCCT CTGGTAAGAC CCTCCTTGAG 550 

GCCATCGATG CCATTGACAT GCCCACTCGT CCCACCGACA AGCCTCTCCG 600 

TCTCCCACTC CAGGATGTCT ACAAGATCTC TGGTATCGGA ACAGTACCAG 650 

3 0 TCGGTCGTGT TGAGACTGGT ATCATCAAGC CTGGTATGGT TGTCACTTTY 700 

GCCCCCGCCA ACGTCACCAC TGAAGTCAAG TCCGTCGAAA TGCACCACCA 750 

GCAGCTYGTC CAGGGTGTTC CCGGTGACAA CGTTGGCTTC AACGTCAAGA 800 

ACGTYTCTGT CAAGGAAGTC CGCCGTGGTA ACGTTGCCGG TGATTCCAAG 850 

AACGACCCAC CAGCTGGCTG CGCCTCTTTC AAGGCCCAGG TCATCGTCCT 900 

35 CAACCACCCC GGCCAGATCG GTGCTGGTTA CGCCCCAGTC CTTGACTGCC 950 

ACACTGCCCA CATTGCTTGC AAGTTCTCTG AGCTTCTTGA GAAGATTGAC 1000 

CGCCGTACTG GTAAATCCGT CGAAACCAGC CCTAAGTTCG TCAAGTCTGG 1050 

TGATGCCGCT ATTGCCACCA TGGTTCCATC CAAGCCCATG TGCGTTGAGG 1100 

CTTTC 1105 

40 



2) INFORMATION FOR SEQ ID NO: 1271 

45 (i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 1244 bases 

(B) TYPE: Nucleic acid 

(C) STRANDEDNESS: Double 

(D) TOPOLOGY: Linear 



50 



(ii) MOLECULE TYPE: Genomic DNA 
(vi) ORIGINAL SOURCE: 
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(A) ORGANISM: Aspergillus versicolor 

(B) STRAIN: WSA-175 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 1271 

5 

GAGCGTGAGC GTGGTATCAC CATCGATATC GCTCTCTGGA AGTTCCAGAC 50 

CCCTAAGTAT GAGGTCACCG TCATTGGTAT GTTGTCCTTC TTGTGTTACC 100 

ATCGAAACAT ATCTAACCTA CAACTGCAGA CGCCCCCGGT CACCGTGACT 150 

TCATCAAGAA CATGATCACT GGTACCTCCC AGGCCGACTG CGCTATTCTC 200 

10 ATCATTGCTT CCGGTACTGG TGAATTCGAG GCTGGTATCT CCAAGGATGG 250 

CCAGACCCGT GAGCACGCTC TGCTCGCTTT CACCCTCGGT GTCCGTCAGC 300 

TCATCGTTGC CCTCAACAAG ATGGACACTG CTGGCTGGGC TGAGGCTCGT 350 

TACAACGAAA TCGTCAAGGA AACTTCCGGT TTCATCAAGA AGGTCGGCTA 400 

CAACCCCAAG TCGGTTCCCT TCGTCCCCAT CTCCGGTTTC AACGGTGACA 450 

15 ACATGCTTGA GCCCTCCTCC AACTGCCCCT GGTACAAGGG TTGGGAGAAG 500 

GAGACCAAGG CTGGTAAGGC CACTGGTAAG ACCCTCCTCG AGGCCATCGA 550 

CGCCATTGAG CCTCCCGTCC GTCCCTCCAA CAAGCCTCTC CGTCTTCCCC 600 

TCCAGGATGT CTACAAGATC TCTGGTATTG GAACTGTCCC CGTCGGCCGT 650 

GTCGAGACCG GTACCATCGT CCCCGGTATG GTCGTCACCT TCGCTCCCGC 700 

20 CAACGTCACC ACTGAAGTCA AGTCCGTTGA GATGCACCAC CAGCAGCTCA 750 

AGGAGGGTGT TCCCGGTKAC AACGTTGGTT TCAACGTGAA GAACGTTTCC 800 

GTCAAGGAAG TCCGCCGTGG TAACGTCGCT GGTGACTCCA AGAACGACCC 850 

CCCTGCCGGT GCTGCCTCTT TCACCGCCCA GGTCATCGTC CTCAACCACC 900 

CCGGTCAGGT CGGCGCTGGT TACGCTCCCG TCCTCGACTG CCACACCGCT 950 

25 CACATTGCCT GCAAGTTCGC TGAGCTCCAG GAGAAGATCG ACCGCCGTAC 1000 

CGGAAAGTCT GTCGAATYTG CCCCCAAGTT CATCAAGTCT GGTGACGCCG 1050 

CTATCGTCAA GATGATTCCC TCCAAGCCCA TGTGTGTCGA GTCTTTCACT 1100 

GACTACCCTC CTYTCGGCCG TTTCGCCGTC CGTGACGTAA GTTCTTTCCC 1150 

CAGCTTTTCG ATGCTACCCT TCTMTGAATC ACGTGTCATG TCTTGGCACC 1200 

30 CGCCCATCAC ATGACCACGC AACCCTATAC CCCGCCACAC CCTT 1244 



2) INFORMATION FOR SEQ ID NO: 1272 



35 



(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 1032 bases 

(B) TYPE: Nucleic acid 

(C) STRANDEDNESS : Double 
40 (D) TOPOLOGY: Linear 

(ii) MOLECULE TYPE: Genomic DNA 

(vi) ORIGINAL SOURCE: 
45 (A) ORGANISM: Exophiala. moniliae 

(B) STRAIN: WSA-219 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 1272 

50 GCTGAGCGTG AGCGTGGTAT CACCATCGAT ATCGCTCTCT GGAAGTTCGA 50 

GACCCCC7VAG TACTATGTCA CCGTCATCGA CGCCCCCGGT CATCGTGACT 100 

TCATCAAGAA CATGATCACT GGTACTTCCC AAGCTGACTG CGCCATTCTC 150 

ATCATTGCTG CCGGTACTGG TGAATTCGAA GCCGGTATCT CCAAGGATGG 200 
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TCAGACCCGT GAGCACGCTC TGCTTGCCTA 
TCATTGTCGC CATCAACAAG ATGGACACTA 
TTCAACGAAA TCATCAAGGA GACTTCCAGC 
CAACCCCAAG TCCGTTCCTT TCGTCCCCAT 
5 ACATGATCGA CGTCTCCACC AACTGCCCCT 
GAGACCAAGG CTGGCAAGGC CTCTGGCAAG 
CGCCATTGAC CCCCCCTCTC GTCCCACCGA 
TCCAGGATGT GTACAAGATC TCTGGTATCG 
GTCGAGACTG GTATCATCAA GGCCGGTATG 

10 CAACGTCACC ACTGAAGTCA AGTCCGTCGA 
CCGAGGGTGT TCCAGGTGAC AACGTCGGTT 
GTCAAGGAGG TTCGTCGTGG AAACGTTTGC 
ACCCAAGGGC GCTGATTCCT TCAACGCCCA 
CTGGTCAAGT TGGTGCTGGC TACGCCCCAG 

15 CACATTGCTT GCAAGTTCTC TGAGCTTCTC 
CGGAAAGTCG ATCGAAAACA ACCCCAAGTT 
CCATCGTCAA GATGGTTCCC AGCAAGCCCA 



CACCCTGGGT GTCAAGGAGG 250 

CCAAGTGGTC TGAGGACCGT 300 

TTCATCAAGA AGGTCGGCTA 350 

CTCCGGCTTC AACGGTGACA 400 

GGTACAAGGG CTGGGAGAAG 4 50 

ACTCTCCTTG AGGCCATCGA 500 

CAAGCCTYTC CGTCTCCCTC 550 

GAACGGTGCC CGTCGGTCGT 600 

GTCGTTACCT TCGCTCCTGC 650 

AATGCACCAC GAACAGCTCG 7 00 

TCAACGTCAA GAACGTYTCC 750 

GGTGACTCCA AGAACGACCC 8 00 

GGTCATCGTC TTGAACCACC 850 

TGTTGGATTG CCACACTGCC 900 

GAGAAGATCG ATCGTCGFCAC 950 

CATCAAGTCT GGTGACGCTG 1000 

TG 1032 



20 



30 



2) INFORMATION FOR SEQ ID NO: 1273 



(i) SEQUENCE CHARACTERISTICS: 
(A) LENGTH: 1106 bases 
25 (B) TYPE: Nucleic acid 

(C) STRANDEDNESS: Double 

(D) TOPOLOGY: Linear 



(ii) MOLECULE TYPE: Genomic DNA 

(vi) ORIGINAL SOURCE: 

(A) ORGANISM: Hortaea werneckii 

(B) STRAIN: ATCC 34944 

35 (xi) SEQUENCE DESCRIPTION: SEQ ID NO: 1273 

TAACAGCCCG TGCGACCGTC TCGTCCAATA ACTAATGATT CGCAGGTACT 50 

ATGTCGTGAG TATCCGGTCC TTTTTTGTTA ATTTACCAGA AATGACGAAA 100 

TTTTTGACTA ATTAACACAC TCAGACCGTC ATTGACGCCC CGGGTCACCG 150 

4 0 TGACTTCATC AAGAACATGA TCACTGGTAC CTCCCAGGCC GACTGCGCTG 200 

TCCTCATCAT TGCTGCCGGT ACTGGTGAGT TCGAGGCTGG TATCTCCAAG 250 

GATGGCCAGA CCCGTGAGCA CGCCCTGCTC GCCTACACCC TCGGTGTCAA 300 

GCAGCTCATC GTCGCCATCA ATAAGATGGA CACCACCAAG TGGTCCGAGG 350 

AGCGTTACGG CGAGATCATC AAGGAGACCT CTGCCTTCAT CAAGAAGGTC 4 00 

45 GGTTTCAACC CGAAGCACGT CCCGTTCGTC CCGATCTCCG GTTTCAACGG 4 50 

TGACAACATG ATCGAGGCCT YCACCAACTG CCCGTGGTAC AAGGGCTGGG 500 

AGAAGGAGAC CAAGGCCAAG GTCACCGGCA AGACCCTYCT TGAGGCCATT 550 

GACAACATCG ACCCGCCGAG CCGTCCTTCC GACAAGCCGC TCCGTCTTCC 600 

CCTCCAGGAT GTCTACAAGA TCGGTGGTAT TGGGACAGTC CCAGTCGGCC 650 

50 GTGTCGAGAC CGGTACCATC AAGGCCGGCA TGGTCGTTAC CTTCGCTCCG 700 

GCTGGTGTCA CCACTGAAGT GAAGTCCGTT GARATGCACC ACGAGCAGCT 750 

CGCTGAGGGT YTGCCGGGTG ACAACGTCGG CTTCAACGTC AAGAACGTTT 800 

CCGTCAAGGA GATCCGTCGT GGCAACGTTG CTGGTGACAG CAAGGCTGAC 850 
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CCGCCGAAGG GCTGCGACAG CTTCAACGCC 

CCCTGGCCAG GTCGGTGCTG GTTACGCTCC 

CCCACATTGC CTGCAAGTTC GGCGAGCTCC 

TYTGGCAAGT CCATTGAAGC CTYGCCTAAG 

5 TGCCATYGTC AAGATGATTC CGTCCAAGCC 
CTGAGT 



PCT/CAOO/01150 

CAGGTCATCG TCCTGAACCA 900 
AGTCCTGGAC TGCCACACTG 950 
TCGAGAAGAT CGACCGTCGC 1000 
TACATCAAGT CTGGTGACGC 1050 
GATGTGCGTT GAGCCATTCA 1100 

1106 



10 2) INFORMATION FOR SEQ ID NO: 1274 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 1119 bases 

(B) TYPE: Nucleic acid 
15 (C) STRANDEDNESS: Double 

(D) TOPOLOGY: Linear 

(ii) MOLECULE TYPE: Genomic DNA 

20 (vi) ORIGINAL SOURCE: 

(A) ORGANISM: Fusarium solani 

(B) STRAIN: ATCC 32793 



25 



50 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 1274 



CTCAAGGCCG AGCGTGAGCG TGGTATCACC ATCGACATTG CCCTCTGGAA 50 

GTTCGAGACT CCCCGCTACT ATGTCACCGT CATTGGTATG TTGCTGTCAC 100 

CTCTCTCACA CATGTCTCAC CACTAACAAT CAACAGACGC CCCCGGCCAC 150 

CGTGATTTCA TCAAGAACAT GATCACTGGT ACTTCCCAGG CCGACTGCGC 200 

30 CATTCTCATC ATTGCCGCCG GTACTGGTGA GTTCGAGGCT GGTATCTCCA 250 

AGGATGGCCA GACCCGTGAG CACGCCCTGC TCGCCTACAC CCTCGGTGTC 300 

AAGAACCTCA TTGTCGCCAT CAACAAGATG GACACCACCA AGTGGTCCGA 350 

GTCCCGTTAC CAGGAGATCA TCAAGGAGAC CTCCAACTTC ATCAAGAAGG 400 

TCGGCTACAA CCCCAAGGCT GTCGCTTTCG TCCCCATCTC CGGTTTCAAC 450 

35 GGCGACAACA TGCTTACTCC CTCCACCAAC TGCCCCTGGT ACAAGGGCTG 500 

GGAGCGTGAG ATCAAGTCCG GCAAGCTCAC TGGCAAGACC CTCCTCGAGG 550 

CCATTGACTC CATCGAGCCC CCCAAGCGTC CCGTCGACAA GCCCCTYCGA 600 

CTTCCCCTCC AGGATGTCTA CAAGATCGGT GGTATTGGCA CGGTTCCCGT 650 

CGGCCGTATC GAGACTGGTG TCATCAAGCC CGGTATGGTC GTTACCTTMG 700 

40 CCCCCTCCAA CGTCACCACT GAAGTCAAGT CCGTCGARAT GCACCACGAG 750 

CAGCTCTYTG AGGGTCTTCC CGGTGACAAC GTCGGCTTCA ACGTGAAGAA 800 

CGTYTCCGTC AAGGAGATCC GACGTGGCAA CGTCGCTGGT GACTCCAAGA 850 

ACGACCCCCC TYTGGGTGCC GCCTCTTTCA CCGCCCAGGT CATTGTCCTC 900 

AACCACCCTG GCCAGGTCGG TGCCGGTTAC GCCCCCGTTY TGGACTGCCA 950 

45 CACTGCCCAC ATTGCCTGCA AGTTCGCCGA GATCCAGGAG AAGATCGACC 1000 

GCCGAACTGG TAAGGCTGTT GAGTCCGCCC CCAAGTTCAT CAAGTCTGGT 1050 

GACTCCGCCA TCGTCAAGAT GGTTCCCTCC AAGCCCATGT GCGTTGAGGC 1100 

TTTCACTGAC TACCCCCCT 1119 



2) INFORMATION FOR SEQ ID NO: 1275 
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(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 1113 bases 

(B) TYPE: Nucleic acid 

(C) STRANDEDNESS: Double 

(D) TOPOLOGY: Linear 



(ii) MOLECULE TYPE: Genomic DNA 

(vi) ORIGINAL SOURCE: 

(A) ORGANISM: Aureobasidium pullulans 

(B) STRAIN: WSA-234 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 1275 



CTGAAGTCTG AGCGTGAGCG 
GTTCGAGACC CCCAAGTACA 
GTGATTTCAT CAAGAACATG 
ATTCTCATCA TTGCTGCCGG 
GGATGGCCAG ACTCGTGAGC 
AGCAGCTCAT CGTTGCCATC 
GCCCGTTACC AGGAGATCAT 
CGGCTACAAC CCCAAGCACG 
GAGACAACAT GATCGAGGTC 
GAGAAGGAGA CCAAGGCCAA 
TGACGCCATC GACCCTCCTT 
CCCTCCAGGA TGTCTACAAG 
CGTGTTGAGA CCGGTAAGAT 
CGCTGGTGTC ACCACCGAGG 
TCACCGAGGG TCTTCCCGGT 
TCCGTCAAGG AGATCCGTCG 
CCCCCCCAAG GGTTGTGACT 
ACCCTGGTCA GGTCGGTGCT 
GCCCACATTG CTTGCAAGTT 
AACCGGCAAG TCCGTTGAGG 
CCGCCATCGT CAAGATGGTT 
ACCGACTACC CTCCTYTCGG 
TCAATCATTC TTC 



TGGTATCACT ATCGATATCG 
TGGTCACCGT CATCGATGCC 
ATCACTGGTA CCTCCCAGGC 
TACTGGTGAG TTCGAGGCTG 
ACGCTCTCCT CGCCTACACC 
AACAAGATGG ACACCACCAA 
CAAGGAGACC TCCGGTTTCA 
TTCCCTTCGT CCCCATCTCC 
AGCTCCAACT GCCCTTGGTA 
GGCCACTGGC AAGACCCTCC 
CCCGTCCTAC CGACAAGCCC 
ATCGGTGGTA TTGGCACGGT 
CATGGGTGGT ATGGTTGTCA 
TCAAGTCCGT CGAGATGCAC 
GACAACGTCG GCTTCAACGT 
TGGC/yVCGTT GCCGGTGACT 
CCTTCAACGC CCAGGTCATC 
GGTTACGCAC CCGTCCTCGA 
YTCCGAGCTT GTTGAGAAGA 
CTGCCCCCAA GTTCATCAAG 
CCCTCCAAGC CCATGTGTGT 
TCGTTTCGCC GTCCGGTATG 



CCCTCTGGAA 


50 


CCCGGTCACC 


100 


TGACTGCGCC 


150 


GTATCTCCAA 


200 


CTTGGTGTCA 


250 


GTGGTCTGAG 


300 


TCAAGAAGGT 


350 


GGCTTCAACG 


400 


CAAGGGTTGG 


450 


TCGAGGCCAT 


500 


CTCCGTCTTC 


550 


GCCCGTCGGT 


600 


CCTTCGCCCC 


650 


CACGAGCAGC 


700 


CAAGAACGTY 


750 


CCAAGAACGA 


800 


GTCCTGAACC 


850 


CTGCCACACT 


900 


TTGACCGCAG 


950 


TCTGGTGACG 


1000 


TGAGGCTTTC 


1050 


TTTTTCTTCT 


1100 




1113 



2) INFORMATION FOR SEQ ID NO: 1276 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 731 bases 

(B) TYPE: Nucleic acid 

(C) STRANDEDNESS: Double 

(D) TOPOLOGY: Linear 

(ii) MOLECULE TYPE: Genomic DNA 

(vi) ORIGINAL SOURCE: 

(A) ORGANISM: Blastomyces dermatitidis 

(B) STRAIN: ATCC 14112 
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(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 1276 



GGTCTGAGAC 
AAGAAGGTCG 
TTTCGAGGGT 
AGGGCTGGAA 
CTCGATGCCA 
TCTCCGTCTT 
TTCCCGTTGG 
ACCTTCGCCC 
CCACCAGCAG 
TCAAGAACGT 
TCCAAGAACG 
CGTCCTCAAC 
ACTGCCACAC 
ATTGACCGCC 
GTCCGGTGAT 



CCGTTTCAAC 
GATACAACCC 
GACAACATGA 
CAAGGAGACT 
TTGACGCCAT 
CCCCTCCAGG 
ACGTGTCGAG 
CTGCCAACGT 
CTCCAGGCTG 
TTCCGTCAAG 
ACCCCCCCAA 
CACCCCGGCC 
TGCCCACATT 
GTACCGGAAA 
GCTGCTATCG 



GAAATTATCA AGGAAGTCAG CAACTTCATC 
C7VAGTCTGTT CCCTTCGTGC CAATCTCCGG 
TTGAGCCTTC CCCCAACTGC CCTTGGTACA 
GCTGCTGGCA AGGCCGCCGG TAAGACTCTT 
CGACCCCCCC GTCCGTCCTA CCGAGAAGCC 
ATGTGTACAA GATCTCCGGT ATTGGCACTG 
ACTGGTGTCA TCAAGCCTGG TATGGTCGTG 
CACCACTGAA GTCAAGTCCG TTGAAATGCA 
GTTACCCTGG TGACAACGTC GGCTTCAACG 
GAAGTCCGCC GTGGCAACGT TGCCGGTGAC 
GGGCTGCGAG TCCTTCAACG CCCAGGTCAT 
AGGTTGGCGC TGGTTATGCC CCAGTCCTCG 
GCTTGCAAGT TCTCTGAGCT CATCGAGAAG 
GTCTGTTGAG GACAACCCCA AGTTCATCAA 
TCAAGATGAT T 



50 
100 
150 
200 
250 
300 
350 
400 
450 
500 
550 
600 
650 
700 
731 



2) INFORMATION FOR SEQ ID NO: 1277 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 104 6 bases 

(B) TYPE: Nucleic acid 

(C) STRANDEDNESS: Double 

(D) TOPOLOGY: Linear 

(ii) MOLECULE TYPE: Genomic DNA 

(vi) ORIGINAL SOURCE: 

(A) ORGANISM: Exophlala dermatitidis 

(B) STRAIN: ATCC 7 6088 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 1277 

CCGAGCGTGA GCGTGGTATC ACCATCGATA TCGCCCTCTG GAAGTTCGAG 50 

ACCCCCAAGT ACTATGTCAC CGTCATCGAC GCCCCGGGTC ATCGTGACTT 100 

TATCAAGAAC ATGATCACTG GTACCTCGCA GGCCGACTGC GCCATCTTGA 150 

TCATTGCCGC CGGTACCGGT GAATTCGAAG CCGGTATCTC CAAGGATGGT 200 

CAGACCCGTG AGCACGCTCT GCTCGCCTAC ACCTTGGGTG TCAAGCAGCT 250 

CATCGTCGCC ATCAACAAGA TGGACACCAC CAAGTGGTCC GAGGAGCGTT 300 

TCAACGAAAT CATCAAGGAG ACTTCCAACT TCATCAAGAA GGTCGGCTAC 350 

AACCCCAAGG CCGTTCCTTT CGTCCCCATC TCCGGCTTCA ACGGTGACAA 400 

CATGATTGAG GTCTCCACCA ACTGCCCGTG GTACAAGGGA TGGGAGAAGG 450 

AGTCCAAGGC TGGCAAGGCC ACCGGCAAGA CCCTCCTCGA GGCCATTGAC 500 

GCCATCGACC CACCCACCCG TCCCACCGAC AAGCCTCTCC GTCTCCCTCT 550 

CCAGGATGTC TACAAGATCT CTGGTATCGG AACGGTTCCT GTCGGTCGTG 600 

TCGAGACCGG TACCATCAAG GCCGGTATGG TCGTCACCTT CGCTCCGGCC 650 

AACGTCACCA CTGAAGTCAA GTCCGTCGAA ATGCACCACG AGCAGCTCGC 700 

CGAGGGTYTG CCAGGTGACA ACGTTGGCTT CAACGTCAAG AACGTYTCCG 750 

TCAAGGAGGT TCGTCGTGGT AACGTTGCCG GTGACTCCAA GAACGACCCG 800 
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CCCAAGGGTG CCGAGTCCTT CAACGCCCAG GTCATTGTCC TCAACCACCC 850 

TGGTCAGATC GGTGCCGGCT ACGCTCCAGT CTTGGATTGC CACACTGCCC 900 

ACATTGCTTG CAAGTTCGCC GAGTTGCTCG AGAAGATCGA CCGTCGTACC 950 

GGAAAGTCCA TCGAGAACAA CCCCAAGTTC ATCAAGTCTG GTGATGCTGC 1000 

5 CATCGTC/VAG ATGATTCCCA GCAAGCCCAT GTGTGTCGAG GCTTTC 104 6 



10 



50 



2) INFORMATION FOR SEQ ID NO: 1278 



(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 1109 bases 

(B) TYPE: Nucleic acid 

(C) STRANDEDNESS: Double 
15 (D) TOPOLOGY: Linear 

(ii) MOLECULE TYPE: Genomic DNA 

(vi) ORIGINAL SOURCE: 
2 0 (A) ORGANISM: Fusarium monili forme 

(B) STRAIN: WSA-213 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 1278 

25 CGTGAGCGTG GTATCACCAT CGATATTGCT CTCTGGAAGT TCGAGACTCC 50 

TCGCTACTAT GTCACCGTCA TTGGTATGTT GTCGCTCATA CCTCATCCTA 100 

CTTCCTCATA CTAACACATC ATTCAGACGC TCCCGGTCAC CGTGATTTCA 150 

TCAAGAACAT GATCACTGGT ACTTCCCAGG CCGATTGCGC CATTCTCATC 200 

ATTGCCGCCG GTACTGGTGA GTTCGAGGCT GGTATCTCCA AGGATGGCCA 250 

30 GACCCGTGAG CACGCTCTTC TTGCCTACAC CCTTGGTGTC AAGAACCTCA 300 

TCGTCGCCAT CAACAAGATG GACACCACCA AGTGGTCTGA GGCCCGTTAC 350 

CAGGAGATCA TCAAGGAGAC CTCCTCTTTC ATCAAGAAGG TCGGCTACAA 4 00 

CCCCAAGGCT GTCGCTTTCG TCCCCATCTC CGGTTTCAAC GGTGACAACA 4 50 

TGCTTACCCC CTCCACCAAC TGCCCCTGGT ACAAGGGTTG GGAGCGTGAG 500 

35 ATCAAGTCCG GCAAGCTCTC CGGCAAGACC CTCCTCGAGG CCATTGACTC 550 

CATCGAGCCT CCCAAGCGTC CCGTTGACAA GCCCCTCCGT CTTCCCCTCC 600 

AGGATGTCTA CAAGATCGGT GGTATTGGAA CGGTTCCCGT CGGCCGTATT 650 

GAGACTGGTG TCATCAAGCC CGGTATGGTC GTTACCTTCG CTCCCTCCAA 7 00 

CGTCACCACT GAAGTCAAGT CCGTCGARAT GCACCACGAG CAGYTCAGTK 750 

40 AGGGCCAGCC CGGTGACAAC GTTGGTTTCA ACGTGAAGAA CGTTTCCGTC 800 

AAGGACATCC GACGTGGTAA CGTCGCTGGT GACTCCAAGA ACGACCCCCC 850 

CCAGGGTGCC GCTTCTTTCA CCGCCCAGGT CATCGTCCTC AACCACCCCG 900 

GCCAGGTCGG TGCTGGTTAC GCTCCTGTCC TCGATTGCCA CACTGCCCAC 950 

ATTGCCTGCA AGTTCGCCGA GATCCAGGAG AAGATCGACC GCCGAACCGG 1000 

45 TAAGGCTACT GAGGCCGCTC CCAAGTTCAT CAAGTCTGGT GACTCCGCCA 1050 

TCGTCAAGAT GGTTCCCTCC AAGCCCATGT GTGTCGAGGC TTTCACTGAC 1100 

TACCCTCCT 1109 



2) INFORMATION FOR SEQ ID NO: 1279 

(i) SEQUENCE CHARACTERISTICS: 
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(A) LENGTH: 765 bases 

(B) TYPE: Nucleic acid 

(C) STRANDEDNESS: Double 

(D) TOPOLOGY: Linear 

5 

(ii) MOLECULE TYPE: Genomic DNA 

(vi) ORIGINAL SOURCE: 

(A) ORGANISM: Aspergillus terreus 
10 (B) STRAIN: WSA-174 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 1279 

CCTGCAAGTG GTCTGAGGAC CGTTACAACG AAATCGTGAA GGAGACCTCC 50 

15 AACTTCATCA AGAAGGTCGG CTACAACCCC AAGGCCGTTC CCTTCGTCCC 100 

CATCTCCGGT TTCAACGGTG ACAACATGCT TGAGCCTTCC CCCAACTGCC 150 

CCTGGTACAA GGGTTGGGAG AAGGAGGGCA AGTCCGGCAA GGTCACCGGT 200 

AAGACTCTCC TCGAGGCCAT CGATGCCATC GAGCCCCCCG TCCGTCCTGC 250 

CAACAAGCCT CTCCGTCTTC CCCTCCAGGA TGTGTACAAG ATCTCTGGTA 300 

20 TCGGAACTGT CCCCGTCGGC CGTGTCGAGA CTGGTGTCAT CACCCCCGGC 350 

ATGGTTGTTA CCTTCGCTCC TTCCAACGTC ACCACTGAAG TGAAGTCCGT 400 

TGAGATGCAC CACCAGCAGC TCAAGGAGGG TCTCCCCGGT GACAACGTTG 4 50 

GTTTCAACGT CAAGAACGTC TCCGTCAAGG AGGTCCGTCG TGGTAACGTC 500 

GCTGGTGACT CCAAGAACGA CCCCCCTGCT GGCGCTGCCT CCTTCACCGC 550 

25 CCAGGTCATC GTTCTCAACC ACCCCGGTCA GGTCGGCGCT GGCTACGCCC 600 

CCGTCCTCGA CTGCCACACT GCCCACATTG CCTGCAAGTT CGCTGAGCTC 650 

CAGGAGAAGA TTGACCGCCG TACCGGAAAG TCTGTTGAGT CTTCTCCCAA 700 

GTTCATCAAG TCTGGTGATG CTGCCATCGT CAAGATGATC CCTYCAAAGC 750 

CCATGTGCGT CGAAG ' 7 65 

30 



2) INFORMATION FOR SEQ ID NO: 1280 

35 (i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 1105 bases 

(B) TYPE: Nucleic acid 

(C) STRANDEDNESS: Double 

(D) TOPOLOGY: Linear 



40 



(ii) MOLECULE TYPE: Genomic DNA 



(vi) ORIGINAL SOURCE: 

(A) ORGANISM: Aspergillus fumigatus 
45 (B) STRAIN: ATCC 64746 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 1280 

GCGTGGTATC ACCATCGACA TTGCCCTCTG GAAGTTCCAG ACTCCCAAGT 50 

50 ATGAGGTCAC TGTCATCGGT AAGCTCGACT CGCCCCGATA TGTTTTGGTG 100 

CTGTAGCTAA CACGATCTGA AGATGCCCCC GGTCACCGTG ACTTCATCAA 150 

GAACATGATC ACTGGTACCT CCCAGGCTGA CTGCGCTATC CTCATCATTG 200 

CCTCCGGTAC TGGTGAGTTC GAGGCTGGTA TCTCCAAGGA TGGCCAGACC 250 
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CGTGAGCACG CTCTGCTGGC TTTCACCCTC GGTGTCAAGC AGCTCATCGT 300 

CGCCCTCAAC AAGATGGACA CCTGCAAGTG GTCCGAGGAT CGTTACAACG 350 

AAATTGTCAA GGAAACCTCC AACTTCATCA AGAAGGTCGG CTACAACCCC 400 

AAGGCCGTTC CCTTCGTMCC CATCKCTGGC TTCAACGGTG ACAACATGCT 4 50 

5 TGAGCCCTCC TCCAACTGCC CCTGGTACAA GGGATGGGAG AAGGAGACCA 500 

AGGCCGGCAA GGTCACTGGT AAGACCCTCA TCGAGGCCAT YGACGCCATT 550 

GAGCCCCCTG TCCGTCCCTC CAACAAGCCC CTYCGTCTTC CCCTCCAGGA 600 

TGTYTACAAG ATCTCTGGTA TCGGAACGGT CCCTGTCGGC CGTGTCGAGA 650 

CCGGTATCAT CAAGCCCGGC ATGGTCGTCA CCTTTGCCCC CGCCAACGTC 700 

10 ACCACTGAAG TCAAGTCCGT CGAAATGCAC CACCAGCAGC TCCAGGAGGG 750 

TGTCCCCGGT GACAACGTCG GTTTCAACGT CAAGAACGTT TCCGTCAAGG 800 

AAGTCCSCCG TGGTAACGTT TGCGGTGACT CCAAGAACGA TCCCCCTCAG 850 

GGTGCTGCCT CCTTCAACGC CCAGGTCATC GTCCTCAACC ACCCCGGTCA 900 

GGTCGGCGCT GGTTACGCCC CCGTCCTCGA CTGCCACACT GCCCACATTG 950 

15 CTTGCAAGTT CTYTGAGCTG CTTGAGAAGA TTGACCGCCG TACCGGCAAG 1000 

TCTGTTGAGA ACAACCCCAA GTTCATCAAG TCCGGTGAWG CCGCCATCGT 1050 

GAAGATGGTT CCTTCCAAGC CCATGTGTGT CGAGTCCTTC ACTGACTACC 1100 

CCCCT 1105 

20 



2) INFORMATION FOR SEQ ID NO: 1281 



(i) SEQUENCE CHARACTERISTICS: 
25 (A) LENGTH: 1343 bases 

(B) TYPE: Nucleic acid 

(C) STRANDEDNESS : Double 
(0) TOPOLOGY: Linear 



30 (ii) MOLECULE TYPE: Genomic DNA 

(vi) ORIGINAL SOURCE: 

(A) ORGANISM: Cryptococcus laurentii 

(B) STRAIN: ATCC 44096 



35 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 1281 



GCCGAGCGAG AGCGAGGTAT CACCATCGAC ATTGCTCTCT GGAAGTTCGA 50 

GACCCCCAAG TACAACGTCA CCGTCATTGA CGCCCCCGGA CACCGAGACT 100 

40 TCATCAAGAA CATGATCACC GGTACCTCCC AGGCCGACTG CGCCATCCTT 150 

ATCATTGCCA CCGGTATCGG AGAGTTCGAG GCCGGTATCT CTAAGGACGG 200 

TCAGACCCGA GAGCACGCTT TGCTCGCCTT CACCCTCGGT GTCAGGCAGC 2 50 

TCATCGTTGC TTGCAACAAG ATGGACACCT GCAAGTGGTC CGAGGACCGA 300 

TTCAACGAGA TCGTCAAGGA GACCAACGGT TTCATCAAGA AGGTCGGATA 350 

45 CAACCCCAAG GCTGTTCCCT TCGTCCCCAT CTCTGGATGG CACGGAGACA 4 00 

AGTGAGTGCC GTTCTTTGCG TTGAGCCCTC TTTGTCGCTC CCCCTCCCTC 4 50 

TCAAGTGGCG GCGGCGGTCT CCACCCACAA ATCGGGTGGC GAATCCGCCA 500 

CACCCACCAC TTCTCGCCAC CGAGTGTGGC ACTTCTTCCA ACTCCTCTTT 550 

CCACTCCTCC TCGTCTCGTC TCTTTTTTTC TCCGTTGTCT TTGACAAGGG 600 

50 GAGTGTGCTG ATAGTAAAGC ATGCTTGAGG AGACCACCAA CATGCCGTGG 650 

TACAAGGGAT GGACCAAGGA GACCAAGTCC GGTGTCGTTA AGGGTAAGAC 700 

CCTCCTCGAC GCCATCGACG CCATCGAGCC TCCTCAACGA CCCACCGACA 750 

AGCCCCTCCG ACTTCCCCTC CAGGATGTCT ACAAGATCGG TGGTATCGGT 800 
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ACGGTGCCCG TCGGCCGAGT CGAGACCGGT GTCATCAAGG CCGGGTAAGT 850 

CACGGGAGCC TGACGGCTGT TGTTGCCACA CCCAACTTAT ATCCAGTATG 900 

GTCGTCACCT TCGCTCCTAC CAACGTCACC ACTGAGGTTA AGTCCGTTGA 950 

GATGCACCAC GAGCAGATCC CTGAGGGTCT TCCCGGAGAC AACGTTGGTT 1000 

5 TCAACGTGTyV GAACGTTTCC ATCAAGGACA TCCGACGAGG AAACGTYTGC 1050 

TCCGACTCCA AGAACGACCC CGCTAAGGAG GCCGCTTCTT TCAACGCCCA 1100 

GGTCATTGTC CTCAACCACC CTGGACAGAT TGGTGCCGGT TACACCCCCG 1150 

TCCTCGACTG CCACACCGCC CACATTGCCT GCAAGTTCGC CGAGCTCATC 1200 

GAGAAGATCG ACCGACGAAC TGGTAAGACC ATGGAGGCCG CCCCCAAGTT 12 50 

10 CGTCAAGTCC GGAGACGCCG CCATTGTCAA GCTCGTTGCC CAGAAGCCCA 1300 

TCTGTGTCGA GTCTTACTCT GACTACCCTC CCCTTGGACG ATT 1343 



15 2) INFORMATION FOR SEQ ID NO: 1282 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 734 bases 

(B) TYPE: Nucleic acid 
20 (C) STRANDEDNESS: Double 

(D) TOPOLOGY: Linear 

(ii) MOLECULE TYPE: Genomic DNA 

25 (vi) ORIGINAL SOURCE: 

(A) ORGANISM: Emmonsia parva 

(B) STRAIN: ATCC 10784 



30 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 1282 



TGGTCCGAGG CTCGTTTCAA CGAAATCATC AAGGAAGTCA CCAACTTCAT 50 

CAAGAAGGTC GGATACAACC CCAAGTCTGT TCCCTTCGTG CCAATCTCTG 100 

GTTTCGAGGG TGATAACATG ATTGAGCCCT CCACCAACTG CCCCTGGTAC 150 

AAGGGCTGGA CCAAGGAGAC CGCTGCCGGC AAGTCAACCG GTAAGACTCT 200 

35 TCTTGACGCC ATTGATGCCA TCGATCAACC CTCCCGTCCT ACCGACAAGC 250 

CCCTCCGTCT TCCCCTCCAG GATGTGTACA AGATCTCCGG TATTGGCACT 300 

GTTCCCGTTG GACGTGTTGA GACTGGTATC ATCAAGCCTG GTATGGTCGT 350 

GACCTTCGCT CCCTCCAACG TCACCACTGA AGTCAAGTCC GTCGAAATGC 4 00 

ACCACCAACA GCTCTTGGCT GGTAACCCCG GTGACAACGT CGGTTTCAAC 4 50 

40 GTCAAGAACG TTTCCGTCAA GGAAGTCCGC CGTGGCAACG TTGCTGGTGA 500 

CTCAT^GAAC GACCCCCCCA AGGGCTGCGA CTCCTTCAAC GCCCAGGTCA 550 

TCGTCCTCAA TCACCCCGGT CAAGTTGGCG CTGGTTATGC CCCAGTCCTC 600 

GACTGCCACA CTGCCCACAT TGCTTGCAAG TTCTCTGAGC TCCTCGAGAA 650 

GATTGACCGC CGTACCGGAA AGTCCACTGA GAACAACCCC AAGTTCATCA 700 

45 AGTCTGGTGA CGCCGCTATC GTCAAGATGG TTCC 734 



2) INFORMATION FOR SEQ ID NO: 1283 

50 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 1107 bases 

(B) TYPE: Nucleic acid 
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(C) STRANDEDNESS: Double 

(D) TOPOLOGY: Linear 

(ii) MOLECULE TYPE: Genomic DNA 

5 

(vi) ORIGINAL SOURCE: 

(A) ORGANISM: Fusarium solanl 

(B) STRAIN: ATCC 62877 

* 10 (xi) SEQUENCE DESCRIPTION: SEQ ID NO: 1283 

CGTGAGCGTG GTATCACCAT CGACATTGCC CTCTGGAAGT TCGAGACTCC 50 

CCGCTACTAT GTCACCGTCA TTGGTATGTT GCTGTCACCT CTCTCACACA 100 

TGTCTCACCA CTAACAATCA ACAGACGCCC CCGGCCACCG TGACTTCATC 150 

15 AAGAACATGA TCACTGGTAC TTCCCAGGCC GACTGCGCCA TTCTCATCAT 200 

TGCCGCTGGT ACTGGTGAGT TCGAGGCTGG TATCTCCAAG GATGGCCAGA 250 

CCCGTGAGCA CGCCCTGCTC GCCTACACCC TCGGTGTCAA GAACCTCATT 300 

GTCGCCATCA ACAAGATGGA CACCACCAAG TGGTCCGAGT CCCGTTACCA 350 

GGAGATCATC AAGGAGACCT CCAACTTCAT CAAGAAGGTC GGCTACAACC 400 

2 0 CCAAGGCTGT CGCTTTCGTC CCCATCTCCG GTTTCAACGG CGACAACATG 450 

CTTACTCCCT CCACCAACTG CCCCTGGTAC T^GGGCTGGG AGCGTGAGAT 500 

CAAGTCCGGC AAGCTCACTG GCAAGACCCT CCTCGAGGCC ATTGACTCCA 550 

TCGAGCCCCC CAAGCGTCCC GTCGACAAGC CCCTCCGACT TCCCCTCCAG 600 

GATGTCTACA AGATCGGTGG TATTGGCACG GTTCCCGTCG GCCGTATCGA 650 

25 GACTGGTGTC ATCAAGCCCG GTATGGTCGT TACCTTCGCC CCCTCCAACG 700 

TCACCACTGA AGTCAAGTCC GTCGAGATGC ACCACGAGCA GCTCTYTGAG 750 

GGTCTTCCCG GTGACAACGT CGGCTTCAAC GTKAAGAACG TYTCCGTCAA 800 

GGAGATCCGA CGTGGCAACG TCGCTGGTGA CTCCAAGAAC GACCCCCCTY 850 

TGGGTGCCGC CTCTTTCACC GCCCAGGTCA TTGTCCTCAA CCACCCTGGC 900 

3 0 CAGGTCGGTG CCGGTTACGC CCCCGTTYTG GACTGCCACA CTGCCCACAT 950 

TGCCTGCAAG TTCGCCGAGA TCCAGGAGAA GATCGACCGC CGAACTGGTA 1000 

AGGCTGTTGA GTCCGCCCCC AAGTTCATCA AGTCTGGTGA CTCCGCCATC 1050 

GTCAAGATGG TTCCCTCCAA GCCCATGTGC GTTGAGGCTT TCACTGACTA 1100 

CCCCCCT 1107 

35 



2) INFORMATION FOR SEQ ID NO: 1284 

40 (i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 1045 bases 

(B) TYPE: Nucleic acid 

(C) STRANDEDNESS: Double 

(D) TOPOLOGY: Linear 



45 



(ii) MOLECULE TYPE: Genomic DNA 



(vi) ORIGINAL SOURCE: 

(A) ORGANISM: Sporothrix schenckii 
50 (B) STRAIN: ATCC 14285 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 1284 
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GCTCAAGGCC GAGCGTGAGC GCGGTATCAC CATCGATATT GCTCTGTGGA 50 

AGTTCGAGAC CCCCAAGTAC TACGTCACCG TCATTGACGC CCCCGGTCAT 100 

CGCGATTTCA TCAAGAACAT GATCACTGGT ACCTCGCAGG CCGACTGCGC 150 

CATTCTCATC ATTGCCGCTG GTACTGGTGA GTTCGAGGCT GGTATCTCCA 200 

5 AGGATGGCCA GACTCGTGAG CACGCTCTGC TCGCCTACAC CCTGGGTGTG 250 

CGGCAGCTGA TCGTCGCCAT CAACAAGATG GACACGGCCA AGTGGGCTGA 300 

GGCTCGTTAC CAGGAGATCA TCAAGGAGAC CTCCAACTTC ATCAAGAAGG 350 

TCGGCTACAA CCCCAAGACT GTTGCCTTCG TCCCCATCTC GGGCTTCCAC 4 00 

GGCGACAACA TGCTTACTCC CTCGACCAAC TGCCCCTGGT ACAAGGGCTG 4 50 

10 GGAGAAGGAG GGCAAGAGCG GCAAGGTTAC CGGTAAGACT CTGCTGGACG 500 

CCATTGACGC CGTCGAGCCC CCCAAGCGCC CCACGGACAA GCCCCTGCGT 550 

CTGCCCCTCC AGGATGTCTA CAAGATCGGC GGTATCGGCA CTGTCCCTGT 600 

CGGCCGTATC GAGACTGGTG TCCTGAAGCC CGGCATGGTC GTCACCTTTG 650 

CCCCGTCCAA CGTCACCACT GAAGTCAAGT CCGTCGAGAT GCACCACGAG 700 

15 CAGCTTGTTG AGGGTGTTCC CGGCGACAAC GTCGGCTTCA ACGTCAAGAA 7 50 

CGTYTCCGTC AAGGAGATCC GTYGTGGCAA CGTTGCCGGT GACTCCAAGA 800 

ACGACCCCCC CTCGGGCGCC GCCACCTTCA ACGCCCAGGT CATTGTCCTG 850 

AACCACCCCG GCCAGGTCGG CAACGGCTAC GCCCCGGTTY TGGACTGCCA 900 

CACCGCCCAC ATTGCCTGCA AGTTCACCGA GATCCTTGAG AAGATCGACC 950 

20 GCCGTACCGG CAAGTCGGTT GAGAACAACC CCAAGTTCAT CAAGTCGGGT 1000 

GACGCCGCCA TTGTCAAGCT GACGCCCTYG AAGCCCATGT GCGTT 1045 



25 2) INFORMATION FOR SEQ ID NO: 1285 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 7 64 bases 

(B) TYPE: Nucleic acid 
30 (C) STRANDEDNESS: Double 

(D) TOPOLOGY: Linear 

(ii) MOLECULE TYPE: Genomic DNA 

35 (vi) ORIGINAL SOURCE: 

(A) ORGANISM: Aspergillus nidulans 

(B) STRAIN: WSA--176 



40 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 1285 



TTGCCAGTGG TCCGAGGCCC GTTACAACGA AATCGTCAAG GAGACTTCCG 50 

GTTTCATCAA GAAGGTCGGA TACAACCCCA AGTCCGTTGC CTTCGTCCCC 100 

ATCTCCGGTT TCAACGGTGA CAACATGCTC GAGGCCTCTA CCAACTGCCC 150 

CTGGTACAAG GGTTGGGAGA AGGAGACCAA GGCCGGTAAG GCCACTGGTA 200 

45 AGACCCTCCT TGAGGCCATC GACGCCATTG AGCCCCCCAC CCGTCCCTCC 250 

AACAAGCCCC TCCGTCTTCC CCTCCAGGAT GTCTACAAGA TCTCCGGTAT 300 

TGGAACTGTC CCCGTCGGCC GTGTCGAGAC TGGTGTTATC ACCCCCGGCA 350 

TGGTCGTCAC CTTCGCTCCT GCCAACGTCA CCACTGAAGT CAAGTCCGTT 4 00 

GAGATGCACC ACCAGCAGCT CAAGGAGGGT GTCCCCGGTG ACAACGTCGG 4 50 

50 TTTCAACGTC AAGAACGTTT CCGTCAAGGA AATCCGTCGT GGTAACGTTG 500 

CCTCCGACTC CAAGAACGAC CCCGCCTCCG GCGCTGCCTC TTTCAACGCC 550 

CAGGTCATCG TTCTCAACCA CCCCGGTCAG GTCGGTGCTG GTTACGCCCC 600 

CGTCCTCGAC TGCCACACCG CCCACATTGC TTGCAAGTTC TCTGAGCTTC 650 

698 



SUBSTITUTE SHEET (RULE 26) 



wo 01/23604 



PCT/CAOO/01150 



TTGAGAAGAT TGACCGCCGT ACCGGAAAGG CTGTTGAAAC CAGCCCCAAG 
TTCATCAAGT CCGGTGACGC TGCCATCGTC AAGATGATTC CTTCCAAGCC 
CATGTGCGKT CCGA 



700 
750 
764 



2) INFORMATION FOR SEQ ID NO: 1286 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 971 bases 

(B) TYPE: Nucleic acid 

(C) STRANDEDNESS: Double 

(D) TOPOLOGY: Linear 

(ii) MOLECULE TYPE: Genomic DNA 

(vi) ORIGINAL SOURCE: 

(A) ORGANISM: Cladophialophora carrionii 

(B) STRAIN: ATCC 16264 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 1286 

ACCATCGATA TCGCGCTCTG GAAGTTCGAG ACTCCCAAGT ACTTCGTCAC 50 

CGTCATCGAT GCCCCTGGTC ATCGTGACTT CATCAAGAAC ATGATCACTG 100 

GTACCTCCCA GGCTGATTGT GCTATTCTCA TCATTGCCGC TGGTACTGGT 150 

GAGTTCGAGG CCGGTATCTC CAAGGATGGC CAGACCCGTG AGCATGCTCT 200 

GCTCGCCTAC ACCCTGGGCG TGAAGCAGCT TATCGTCGCC ATCAACAAGA 250 

TGGACACCAC CAAATGGTCT GAGGATCGTT TCAACGAAAT CATCAAGGAG 300 

ACTTCCAACT TCATCAAGAA GGTCGGATAC AACCCCAAGT CCGTTCCATT 350 

CGTGCCCATC TCCGGCTTCA ACGGTGACAA CATGATCGAC GTCTCCACCA 400 

ATGCCCCCTG GTACAAGGGC TGGGAAAAGG AGTCCAAGGC TGGCAAGGCC 4 50 

ACCGGCAAGA CCCTCCTTGA GGCTATCGAC TCCATCGACC CTCCTGCTCG 500 

TCCCACCGAC AAGCCTCTCC GTCTCCCACT CCAGGATGTC TACAAGATTT 550 

CTGGTATCGG CACGGTGCCC GTCGGTCGTG TTGAGACTGG TACCATCAAG 600 

GCCGGTATGG TCGTCACCTT TGCCCCCGCC AACGTCACCA CTGAAGTCAA 650 

GTCCGTCGAA ATGCACCACG AACAGCTYGC CGAGGGCGTT CCGGGTGACA 700 

ACGTCGGCTT CAACGTCAAG AACGTYTCCG TGAAGGAGGT TCGTCGTGGA 750 

AACGTTGCTG GTGACTCCAA GAACGACCCC CCCAAGGGTG CCGACTCCTT 800 

CAACGCCCAG GTCATCGTCC TCAACCACCC TGGTCAGGTC GGTGCTGGCT 850 

ACGCCCCGGT CTTGGATTGC CACACTGCCC ACATTGCCTG CAAGTTCTYT 900 

GAGCTCCTCG AGAAGATCGA TCGTCGKACC GGCAAGTCCA TGGAAAACAA 950 

CCCCAAGTTC ATCAAGTCTG G 971 



2) INFORMATION FOR SEQ ID NO: 1287 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 732 bases 

(B) TYPE: Nucleic acid 

(C) STRANDEDNESS: Double 

(D) TOPOLOGY: Linear 
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(vi) ORIGINAL SOURCE: 

(A) ORGANISM: Exserohilum rostratum 

(B) STRAIN: WSA-215 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 1287 



GGTCTGAGGA CCGTTACCAG 
AAGAAGGTCG GCTACAACCC 
TTTCAACGGA GACAACATGA 
AGGGTTGGGA GAAGGAGACC 
GAGGCCATTG ACGCCATCGA 
CCGTCTTCCC CTCCAGGATG 
CCGTCGGTCG TGTCGAGACC 
TTCGCCCCCG CTGGTGTCAC 
CGAGCAGCTT ACCGAGGGTG 
AGAACGTCTC CGTCAAGGAG 
AAGAACGACC CCCCCAAGGG 
CCTCAACCAC CCTGGTCAGG 
GCCACACCGC CCACATTGCC 
GACCGCCGTA CCGGAAAGTC 
TGGTGACGCG GCCATCGTCA 



GAGATCATCA AGGAGACCTC 
CAAGCACGTT CCCTTCGTCC 
TCGAGGCCTC CAGCAACTGC 
AAGGCCAAGG CCACTGGTAA 
CCCTCCCAGC CGTCCTACCG 
TCTACAAGAT TGGTGGTATT 
GGTATCATCA AGGCCGGTAT 
CACTGAAGTC AAGTCCGTCG 
TCCCCGGTGA CAACGTCGGC 
ATCCGTCGTG GTAACGTTGC 
CTGCGAGTCT TTCAACGCTC 
TCGGTGCCGG TTACGCGCCA 
TGCAAGTTCT CTGAGCTCCT 
TGTCGAAGCC TCTCCCAAGT 
AGATGGTTCC CT 



CAACTTCATC 50 

CCATCTCCGG 100 

CCCTGGTACA 150 

GACCCTCCTT 200 

ACAAGCCCCT 250 

GGCACGGTTC 300 

GGTCGTCACC 350 

AGATGCACCA 4 00 

TTCAACGTCA 4 50 

CGGTGACTCC 500 

AGGTCATTGT 550 

GTCCTCGACT 600 

CGAGAAGATT 650 

TCATCAAGTC 7 00 

732 



2) INFORMATION FOR SEQ ID NO: 1288 



(i) SEQUENCE CHARACTERISTICS: 

(A) ' LENGTH: 337 bases 

(B) TYPE: Nucleic acid 

(C) STRANDEDNESS: Double 

(D) TOPOLOGY: Linear 



(ii) MOLECULE TYPE: Genomic DNA 

(vi) ORIGINAL SOURCE: 

(A) ORGANISM: Bacillus thuringiensis 

(B) STRAIN: HER 1236 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 1288 

AATGGATCCT GTATACGCAC AAAAATTAGG CGTTAACATA GATGAATTRC 50 

TATTATCACA GCCTGATACA GGGGAGCAAG GATTGGAAAT CGCGGAAGCA 100 

CTTGTACGAA GTGGTGCGGT TGACATTATC GTAATTGACT CTGTAGCAGC 150 

TCTTGTACCG AAAGCAGAGA TTGAAGGCGA TATGGGTGAC TCACACGTAG 200 

GTTTACAAGC ACGTTTAATG TCACAAGCAC TTMGTAAGCT TTCAGGAGCA 250 

ATCAACAAAT CARRARCAAT WGCAATCTTT ATTAACCAAA TTCGWGAAAA 300 

AGTTGGGGTT ATGTTCGGAA ACCCAGAAAC AACTCCA 337 



2) INFORMATION FOR SEQ ID NO: 1289 
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(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 336 bases 

(B) TYPE: Nucleic acid 

5 (C) STRANDEDNESS: Double 

(D) TOPOLOGY: Linear 

(ii) MOLECULE TYPE: Genomic DNA 

10 (vi) ORIGINAL SOURCE: 

(A) ORGANISM: Bacillus thuringiensis 

(B) STRAIN: HER 1232 



15 



25 



35 



40 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 1289 



/y^TGGATCCT GTATATGCAC AAAAATTAGG CGTTAACATA GATGAATTRC 50 

TATTATCACA GCCTGATACA GGGGAGCAAG GATTGGAAAT CGCGGAAGCA 100 

CTTGTACGAA GTGGTGCGGT TGACATTATC GTAATTGACT .CTGTAGCAGC 150 

TCTTGTACCG AAAGCAGAGA TTGAAGGCGA TATGGGTGAC TCACACGTAG 200 

20 GTTTACAAGC ACGTTTAATG TCACAAGCAC TTMGTAAGCT TTCAGGAGCA 250 

ATCAACAAAT CARAARCAAT TGCAATCTTT ATTAACCAAA TTCGTGAAAA 300 

AGTTGGGGTT ATGTTCGGAA ACCCAGAAAC AACTCC 336 



2) INFORMATION FOR SEQ ID NO: 1290 



(i) SEQUENCE CHARACTERISTICS: 
(A) LENGTH: 23 bases 
30 (B) TYPE: Nucleic acid 

(C) STRANDEDNESS: Single 

(D) TOPOLOGY: Linear 



(ii) MOLECULE TYPE: DNA 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 1290 
GAYTAYGCIA TGISIGTIAT HGT 23 

2) INFORMATION FOR SEQ ID NO: 1291 



(i) SEQUENCE CHARACTERISTICS: 
45 (A) LENGTH: 23 bases 

(B) TYPE: Nucleic acid 

(C) STRANDEDNESS: Single 

(D) TOPOLOGY: Linear 

50 (ii) MOLECULE TYPE: DNA 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 1291 
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23 



5 2) INFORMATION FOR SEQ ID NO: 1292 

(i) SEQUENCE CHAEIACTERISTICS: 

(A) LENGTH: 20 bases 

(B) TYPE: Nucleic acid 
10 (C) STRANDEDNESS: Single 

(D) TOPOLOGY: Linear 

(ii) MOLECULE TYPE: DNA 
15 (xi) SEQUENCE DESCRIPTION: SEQ ID NO: 1292 

ARISCYTCIA RIATRTGIGC 20 



20 



30 



35 



2) INFORMATION FOR SEQ ID NO: 1293 



(i) SEQUENCE CHARACTERISTICS: 
(A) LENGTH: 20 bases 
25 (B) TYPE: Nucleic acid 

(C) STRANDEDNESS: Single 

(D) TOPOLOGY: Linear 



(ii) MOLECULE TYPE: DNA 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 1293 
ATGGCTGAAT TACCTCAATC 20 

2) INFORMATION FOR SEQ ID NO: 1294 



(i) SEQUENCE CHARACTERISTICS: 
40 (A) LENGTH: 25 bases 

(B) TYPE: Nucleic acid 

(C) STRANDEDNESS: Single 

(D) TOPOLOGY: Linear 

45. (ii) MOLECULE TYPE: DNA 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 1294 
ATGATTGTTG TATATCTTCT TCAAC 25 

50 



2) INFORMATION FOR SEQ ID NO: 1295 
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(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 20 bases 

(B) TYPE: Nucleic acid 

(C) STRANDEDNESS: Single 

(D) TOPOLOGY: Linear 

(ii) MOLECULE TYPE: DNA 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 1295 
CAGAAAGTTT GAAGCGTTGT 



2) INFORMATION FOR SEQ ID NO: 1296 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 20 bases 

(B) TYPE: Nucleic acid 

(C) STRANDEDNESS: Single 

(D) TOPOLOGY: Linear 

(ii) MOLECULE TYPE: DNA 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 1296 
AACGATTCGT GAGTCAGATA 



2) INFORMATION FOR SEQ ID NO: 1297 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 22 bases 

(B) TYPE: Nucleic acid 

(C) STRANDEDNESS: Single 

(D) TOPOLOGY: Linear 

(ii) MOLECULE TYPE: DNA 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 1297 
CGGTCAACAT TGAGGAAGAG CT 



2 ) INFORMATION FOR SEQ ID NO: 1298 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 23 bases 

(B) TYPE: Nucleic acid 

(C) STRANDEDNESS: Single 
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(D) TOPOLOGY: Linear 
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(ii) MOLECULE TYPE: DNA 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 1298 
ACGAAATCGA CCGTCTCTTT TTC 



23 



10 



15 



20 



25 



30 



35 



40 



45 



50 



2) INFORMATION FOR SEQ ID NO: 1299 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 2711 bases 

(B) TYPE: Nucleic acid 

(C) STRANDEDNESS: Double 

(D) TOPOLOGY: Linear 

(ii) MOLECULE TYPE: Genomic DNA 



(vi) 



ORIGINAL SOURCE: 

(A) ORGANISM: Staphylococcus aureus 

(B) STRAIN: 601055 

(C) ACCESSION NUMBER: X71437 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 12 99 



ATGGCTGAAT 
AATGCGTGAA 
CATTGCCAGA 
TATGGATTAA 
AGCACGTATC 
CATCTATTTA 
TATCCGCTTG 
CGCAGCAGCA 
AACTGTTACG 
GATGGTAATG 
ATTAGCCAAT 
CACCACATAA 
AACCCTGATA 
TTTCCCAACT 
ATGAAACAGG 
GAACGTGGAG 
AGTGAATAAG 
AGAAAATTGA 
GGTGTGCGTG 
TTTAAATAAC 
ATATGATTGC 
GCGTTGGTAC 
GCAATATAAC 
TACGTATCGC 
TCAGATACAG 
TTCTGAAAAA 



TACCTCAATC 
TCATTTTTAG 
TGTTCGTGAC 
ATGAACAAGG 
GTTGGTGACG 
TGAAGCAATG 
TTGATGGCCA 
ATGCGTTATA 
TGATATTAAT 
AAAGAGAGCC 
GGTGCATCAG 
CTTAACAGAA 
TTTCAATTGC 
GCTGGACTTA 
TCGTGGTTCA 
GCGGACGTCA 
GCTCGTATGA 
CGGTATCACT 
TCGTTATTGA 
TTATACAAAC 
ACTTGTAAAT 
ATTATTTAGA 
TTACGTAAAG 
ACTTGACCAT 
ATAAAGTTGC 
CAAGCTCAAG 



AAGAATAAAT 
ATTATGCGAT 
GGTTTAAAAC 
TATGACACCG 
TAATGGGTAA 
GTACGTATGG 
AGGTAACTTT 
CTGAAGCGCG 
AAAGATACAA 
GTCAGTCTTA 
GTATCGCGGT 
TTAATCAATG 
TGAGTTAATG 
TTTTAGGTAA 
ATTCAAATGC 
ACGTATTGTT 
TTGAAAAAAT 
GATTTACGTG 
TGTGCGTAAG 
AAACACCTCT 
GGTAGACCGA 
GCATCAAAAG 
CTAAAGATCG 
ATCGATGAAA 
AATGGAAAGC 
CTATTTTAGA 



GAACGAAATA 
GAGTGTTATC 
CAGTACATCG 
GATAAATCAT 
ATATCACCCT 
CTCAAGATTT 
GGTTCAATGG 
TATGACTAAA 
TAGATTTTAT 
CCTGCTCGAT 
AGGTATGGCA 
GTGTACTTAG 
GAAGATATTG 
GAGTGGTATT 
GTTCTCGTGC 
GTCACTGAAA 
TGCAGAGCTC 
ATGAAACAAG 
GATGCAAATG 
TCAAACATCA 
AGCTTATTAA 
ACAGTTGTTA 
TGCCCATATT 
TTATTTCAAC 
TTGCAACAAC 
CATGCGTTTA 



TTACCAGTGA 50 

GTTGCTCGTG 100 

TCGTATACTA 150 

ATAAAAAATC 200 

CATGGTGACT 250 

CAGTTATCGT 300 

ATGGAGATGG 350 

ATCACACTTG 4 00 

CGATAACTAT 4 50 

TCCCTAATTT 500 

ACGAATATTC 550 

CTTAAGTAAG 600 

AAGGTCCTGA 650 

AGACGTGCAT 700 

AGTTATTGAA 750 

TTCCTTTCCA 800 

GTTCGTGACA 850 

TTTACGTACT 900 

CTAGTGTCAT 950 

TTTGGTGTGA 1000 

TTTAAAAGAA 1050 

GAAGACGTAC 1100 

TTAGAAGGGT 1150 

GATTCGTGAG 1200 

GCTTCAAACT 1250 

AGACGTCTAA 1300 
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CAGGTTTAGA GAGA/^CAAA ATTGAAGCTG AATATAATGA GTTATTAAAT 1350 

TATATTAGTG AATTAGAAGC CATCTTAGCT GATGAAGAAG TGTTATTACA 14 00 

GTTAGTTAGA GATGAATTGA CTGAAATTAG AGATCGTTTC GGTGATGAGC 14 50 

GTCGTACAGA AATTCAATTA GGTGGATTTG AAGACTTAGA GGACGAAGAC 1500 

5 TTAATTCCAG AAGAACAAAT AGTAATTACT TTGAGCCATA ATAACTACAT 1550 

TAAACGTTTG CCGGTATCTA CATATCGTGC TCAAAACCGT GGTGGTCGTG 1600 

GTGTTCAAGG TATGAATACA TTGGAAGAAG ATTTTGTCAG TCAATTGGTA 1650 

ACTTTAAGTA CACATGACCA TGTATTGTTC TTTACTAACA AAGGTCGTGT 1700 

ATACAAACTA AAAGGTTATG AAGTGCCTGA GTTATCAAGA CAGTCTAAAG 1750 

10 GTATTCCTGT AGTGAATGCT ATTGAACTTG GAAATGATGA AGTCATTAGT 1800 

ACAATGATTG CTGTTAT^GA CCTTGAAAGT GAAGACAACT TCTTAGTGTT 1850 

TGCAACTAAA CGTGGTGTTG TTAAACGTTC AGCATTAAGT AACTTCTCAA 1900 

GAATAAATAG AAATGGTAAG ATTGCGATTT CGTTCAGAGA AGATGATGAG 1950 

TTAATTGCAG TTCGTTTAAC AAGTGGTCAA GT^AGATATCT TGATTGGTAC 2000 

15 ATCACATGCA TCATTAATTC GATTCCCTGA ATCAACATTA CGTCCTTTAG 2050 

GCCGTACAGC AACGGGTGTG AAAGGTATTA CACTTCGTGA AGGTGACGAA 2100 

GTTGTAGGGC TTGATGTAGC TCATGCAAAC AGTGTTGATG AAGTATTAGT 2150 

AGTTACTGAA AATGGTTATG GTATVACGTAC GCCAGTTAAT GACTATCGTT 2200 

TATCAAATCG TGGTGGTAAA GGTATTAAAA CAGCTACGAT TACTGAGCGT 2250 

2 0 AATGGTAATG TTGTATGTAT CACTACAGTA ACTGGTGAAG AAGATTTAAT 2300 

GATTGTTACT AATGCCGGTG TCATTATTCG ACTAGATGTT GCAGATATTT 2350 

CTCAA/SlATGG TCGTGCAGCA CAAGGTGTTC GCTTAATTCG CTTAGGCGAT 24 00 

GATCAATTTG TTTCAACGGT TGCTAAAGTA AAAGAGGATG CAGATGAAGT 2450 

AAATGAAGAT GAACAATCTA CTGTATCTGA AGATGGTACT GAACAACAAC 2500 

25 GTGAAGCGGT TGTAAATGAT GAAACACCAG GAAATGCAAT TCATACTGAA 2 550 

GTGATTGATT CAGAAGAAAA TGATGAAGAT GGACGTATTG AAGTAAGACA 2 600 

AGATTTCATG GATCGTGTTG AAGAAGATAT ACAACAATCA TCAGATGAAG 2 650 

ATGAAGAATA A 2711 

30 



2) INFORMATION FOR SEQ ID NO: 1300 



(i) SEQUENCE CHARACTERISTICS: 
35 (A) LENGTH: 2628 bases 

(B) TYPE: Nucleic acid 

(C) STRANDEDNESS : Double 

(D) TOPOLOGY: Linear 



40 (ii) MOLECULE TYPE: Genomic DNA 

(vi) ORIGINAL SOURCE: 

(A) ORGANISM: Escherichia coli 

(B) STRAIN: K12 

45 (C) ACCESSION NUMBER: X57174 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 1300 

ATGAGCGACC TTGCGAGAGA AATTACACCG GTCAACATTG AGGAAGAGCT 50 

50 GAAGAGCTCC TATCTGGATT ATGCGATGTC GGTCATTGTT GGCCGTGCGC 100 

TGCCAGATGT CCGAGATGGC CTGAAGCCGG TACACCGTCG CGTACTTTAC 150 

GCCATGAACG TACTAGGCAA TGACTGGAAC AAAGCCTATA AAAAATCTGC 200 

CCGTGTCGTT GGTGACGTAA TCGGTAAATA CCATCCCCAT GGTGACTCGG 250 
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CGGTCTATGA CACGATCGTC CGCATGGCGC AGCCATTCTC GCTGCGTTAT 300 

ATGCTGGTAG ACGGTCAGGG TAACTTCGGT TCTATCGACG GCGACTCTGC 350 

GGCGGCAATG CGTTATACGG AAATCCGTCT GGCGAAAATT GCCCATGAAC 4 00 

TGATGGCCGA TCTCGT^AAAA GAGACGGTCG ATTTCGTTGA TAACTATGAC 450 

5 GGCACGGAAA AAATTCCGGA CGTCATGCCA ACCAAAATTC CTAACCTGCT 500 

GGTGAACGGT TCTTCCGGTA TCGCCGTAGG TATGGCAACC AACATCCCGC 550 

CGCACAACCT GACGGAAGTC ATCAACGGTT GTCTGGCGTA TATTGATGAT 600 

GAAGACATCA GCATTGAAGG GCTGATGGAA CACATCCCGG AGCCGGACTT 650 

CCCGACGGCG GCAATCATTA ACGGTCGTCG CGGTATTGAA GAAGCTTACC 700 

10 GTACCGGTCG CGGCAAGGTG TATATCCGCG CTCGCGCAGA AGTGGAAGTT 7 50 

GACGCCAAAA CCGGTCGTGA AACCATTATC GTCCACGAAA TTCCGTATCA 800 

GGTAAACAAA GCGCGCCTGA TCGAGAAGAT TGCGGAACTG GTAAAAGAAA 850 

AACGCGTGGA AGGCATCAGC GCGCTGCGTG ACGAGTCTGA CAAAGACGGT 900 

ATGCGCATCG TGATTGAAGT GAT^CGCGAT GCGGTCGGTG AAGTTGTGCT 950 

15 CAACAACCTC TACTCCCAGA CCCAGTTGCA GGTTTCTTTC GGTATCAACA 1000 

TGGTGGCATT GCACCATGGT CAGCCGAAGA TCATGAACCT GAAAGACATC 1050 

ATCGCGGCGT TTGTTCGTCA CCGCCGTGAA GTGGTGACCC GTCGTACTAT 1100 

TTTCGAACTG CGTAAAGCTC GCGATCGTGC TCATATCCTT GAAGCATTAG 1150 

CCGTGGCGCT GGCGAACATC GACCCGATCA TCGAACTGAT CCGTCATGCG 1200 

2 0 CCGACGCCTG CAGAAGCGAA AACTGCGCTG GTTGCTAATC CGTGGCAGCT 1250 

GGGCAACGTT GCCGCGATGC TCGAACGTGC TGGCGACGAT GCTGCGCGTC 1300 

CGGAATGGCT GGAGCCAGAG TTCGGCGTGC GTGATGGTCT GTACTACCTG 1350 

ACCGAACAGC AAGCTCAGGC GATTCTGGAT CTGCGTTTGC AGAAACTGAC 14 00 

CGGTCTTGAG CACGAAAAAC TGCTCGACGA ATACAAAGAG CTGCTGGATC 1450 

25 AGATCGCGGA ACTGTTGCGT ATTCTTGGTA GCGCCGATCG TCTGATGGAA 1500 

GTGATCCGTG AAGAGCTGGA GCTGGTTCGT GAACAGTTCG GTGACAAACG 1550 

TCGTACTGAA ATCACCGCCA ACAGCGCAGA CATCAACCTG GAAGATCTGA 1600 

TCACCCAGGA AGATGTGGTC GTGACGCTCT CTCACCAGGG CTACGTTAAG 1650 

TATCAGCCGC TTTCTGAATA CGAAGCGCAG CGTCGTGGCG GGAAAGGTAA 1700 

30 ATCTGCCGCA CGTATTAAAG AAGAAGACTT TATCGACCGA CTGCTGGTGG 17 50 

CGAACACTCA CGACCATATT CTGTGCTTCT CCAGCCGTGG TCGCGTCTAT 1800 

TCGATGAAAG TTTATCAGTT GCCGGAAGCC ACTCGTGGCG CGCGCGGTCG 18 50 

TCCGATCGTC AACCTGCTGC CGCTGGAGCA GGACGAACGT ATCACTGCGA 1900 

TCCTGCCAGT GACCGAGTTT GAAGAAGGCG TGAAAGTCTT CATGGCGACC 1950 

35 GCTAACGGTA CCGTGAAGAA AACTGTCCTC ACCGAGTTCA ACCGTCTGCG 2000 

TACCGCCGGT AAAGTGGCGA TCAAACTGGT TGACGGCGAT GAGCTGATCG 2050 

GCGTTGACCT GACCAGCGGC GAAGACGAAG TAATGCTGTT CTCCGCTGAA 2100 

GGTAAAGTGG TGCGCTTTAA AGAGTCTTCT GTCCGTGCGA TGGGCTGCAA 2150 

CACCACCGGT GTTCGCGGTA TTCGCTTAGG TGAAGGCGAT AAAGTCGTCT 2200 

40 CTCTGATCGT GCCTCGTGGC GATGGCGCAA TCCTCACCGC AACGCAAAAC 2250 

GGTTACGGTA AACGTACCGC AGTGGCGGAA TACCCAACCA AGTCGCGTGC 2300 

GACGAAAGGG GTTATCTCCA TCAAGGTTAC CGAACGTAAC GGTTTAGTTG 2350 

TTGGCGCGGT ACAGGTAGAT GACTGCGACC AGATCATGAT GATCACCGAT 2400 

GCCGGTACGC TGGTACGTAC TCGCGTTTCG GAAATCAGCA TCGTGGGCCG 24 50 

45 TAACACCCAG GGCGTGATCC TCATCCGTAC TGCGGAAGAT GAAAACGTAG 2500 

TGGGTCTGCA ACGTGTTGCT GAACCGGTTG ACGAGGAAGA TCTGGATACC 2550 

ATCGACGGCA GTGCCGCGGA AGGGGACGAT GAAATCGCTC CGGAAGTGGA 2600 

CGTTGACGAC GAGCCAGAAG AAGAATAA 2 628 

50 

2) INFORMATION FOR SEQ ID NO: 1301 
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(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 23 bases 

(B) TYPE: Nucleic acid 

(C) STRANDEDNESS : Single 

(D) TOPOLOGY: Linear 

(ii) MOLECULE TYPE: DNA 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 1301 
GTIMGIAWIM GICCIGSIAT GTA 



2) INFORMATION FOR SEQ ID NO: 1302 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 23 bases 

(B) TYPE: Nucleic acid 

(C) STRANDEDNESS: Single 

(D) TOPOLOGY: Linear 

(ii) MOLECULE TYPE: DNA 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 1302 
TAIADIGGIG GIKKIGCIAT RTA 



2) INFORMATION FOR SEQ ID NO: 1303 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 20 bases 

(B) TYPE: Nucleic acid 

(C) STRANDEDNESS: Single 

(D) TOPOLOGY: Linear 

(ii) MOLECULE TYPE: DNA 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 1303 

GGIGAIGAID YIMGIGARGG 



2) INFORMATION FOR SEQ ID NO: 1304 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 19 bases 

(B) TYPE: Nucleic acid 

(C) STRANDEDNESS: Single 

(D) TOPOLOGY: Linear 
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(ii) MOLECULE TYPE: DNA 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 1304 
CIARYTTIKY ITTIGTYTG 



2) INFORMATION FOR SEQ ID NO: 1305 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 22 bases 

(B) TYPE: Nucleic acid 

(C) STRANDEDNESS : Single 

(D) TOPOLOGY: Linear 

(ii) MOLECULE TYPE: DNA 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 1305 
ATGGTGACTG CATTGTCAGA TG 



2) INFORMATION FOR SEQ ID NO: 1306 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 22 bases 

(B) TYPE: Nucleic acid 

(C) STRANDEDNESS: Single 

(D) TOPOLOGY: Linear 

(ii) MOLECULE TYPE: DNA 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 1306 
GTCTACGGTT TTCTACAACG TC 



2) INFORMATION FOR SEQ ID NO: 1307 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 1923 bases 

(B) TYPE: Nucleic acid 

(C) STRANDEDNESS: Double 

(D) TOPOLOGY: Linear 

(ii) MOLECULE TYPE: Genomic DNA 

(vi) ORIGINAL SOURCE: 

(A) ORGANISM: Staphylococcus aureus 
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(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 1307 



5 ATGGTGACTG CATTGTCAGA TGTAAACAAC ACGGATAATT ATGGTGCTGG 50 

GCAAATACAA GTATTAGAAG GTTTAGAAGC AGTACGTAAA AGACCAGGTA 100 

TGTATATAGG ATCGACTCAG AGAGAGTTGC ACATTAGTGT GGAAATTGTC 150 

GATAATAGTA TCGATGAAGC ATTAGCTGGT TATGCAAATA Z^TTGAAGT 200 

TGTTATTGAA AAAGATAACT GGATTAAAGT AACGGATAAC GGACGTGGTA 250 

10 TCCCAGTTGA TATTCAAGAA AAAATGGGAC GTCCAGCTGT CGAAGTTATT 300 

TTAACTGTTT TACATGCTGG TGGTAAATTC GGCGGTGGCG GATACAAAGT 350 

ATCTGGTGGT TTACATGGTG TTGGTTCATC AGTTGTAAAC GCATTGTCAC 4 00 

AAGACTTAGA AGTATATGTA CACAGAAATG AGACTATATA TCATCAAGCA 4 50 

TATAAAi^AAG GTGTACCTCA ATTTGACTTA AAAGAAGTTG GCACAACTGA 500 

15 TAAGACAGGT ACTGTCATTC GTTTTAAAGC AGATGGAGAA ATCTTCACAG 550 

AGACAACTGT ATACAACTAT GAAACATTAC AGCAGCGTAT TAGAGAGCTT 600 

GCTTTCTTAA ACAAAGGAAT TCAAATCACA TTAAGAGATG AACGTGATGA 650 

AGAAAACGTT AGAGAAGACT CCTATCACTA TGAGGGCGGT ATTAAATCGT 700 

ACGTTGAGTT ATTGAACGAA AATAAAGAAC CTATTCATGA TGAGCCAATT 750 

20 TATATTCATC AATCT7U\AGA TGATATTGAA GTAGAAATTG CGATTCAATA 800 

TAACTCAGGA TATGCCACAA ATCTTTTAAC TTACGCAAAT AACATTCATA 850 

CGTACGAAGG TGGTACGCAT GAAGACGGAT TCAAACGTGC ATTAACGCGT 900 

GTCTTAAATA GTTATGGTTT AAGTAGCAGA TATGAAGAAG AAAAGATAGC 950 

TTCTGGTGAA GATACACGAG AAGGTATGAC AGCAATTATA TCTATCAAAC 1000 

25 ATGGTGATCC TCAATTCGAA GGTCAAACGA AGACAAAATT AGGTAATTCT 1050 

GAAGTGCGTC AAGTTGTAGA TAAATTATTC TCAGAGCACT TTGAACGATT 1100 

TTTATATGAA AATCCACAAG TCGCACGTAC AGTGGTTGAA AAAGGTATTA 1150 

TGGCGGCACG TGCACGTGTT GCTGCGAAAA AAGCGCGTGA AGTAACACGT 1200 

CGTAAATCAG CGTTAGATGT AGCAAGTCTT CCAGGTAAAT TAGCCGATTG 1250 

30 CTCTAGTCAA AGTCCTGAAG AATGTGAGAT TTTCTTAGTC GAAGGGGACT 1300 

CTGCCGGAGG GTCTACAAAA TCTGGTCGTG ACTCTAGAAC GCAGGCGATT 1350 

TTACCATTAC GAGGTAAGAT ATTAAATGTT GAAAAAGCAC GATTAGATAG 14 00 

AATTTTGAAT AACAATGAAA TTCGTCAAAT GATCACAGCA TTTGGTACAG 14 50 

GAATCGGTGG CGACTTTGAT CTAGCGAAAG CAAGATATCA CAAAATCGTC 1500 

35 ATTATGACTG ATGCCGATGT GGATGGAGCG CATATTAGAA CATTGTTATT 1550 

AATATTCTTC TATCGATTTA TGAGACCGTT AATTGAAGCA GGCTATGTGT 1600 

ATATTGCACA GCCACCGTTG TATAAACTGA CACAAGGTAA ACAAAAGTAT 1650 

TATGTATACA ATGATAGGGA ACTTGATAAA CTTAAATCTG AATTGAATCC 1700 

AACACCAAAA TGGTCTATTG CGCTATACAA AGGTCTTGGA GAAATGAATG 1750 

40 CAGATCAATT ATGGGAAACA ACAATGAACC CTGAGCACCG CGCTCTTTTA 1800 

CAAGTAAAAC TTGAAGATGC GATTGAAGCG GACCAAACAT TTGAAATGTT 1850 

AATGGGTGAC GTTGTAGAAA ACCGTAGACA ATTTATAGAA GATAATGCAG 1900 

TTTATGCAAA CTTAGACTTC TAA 1923 

45 

2) INFORMATION FOR SEQ ID NO: 1308 



(i) SEQUENCE CHARACTERISTICS: 
50 (A) LENGTH: 23 bases 

(B) TYPE: Nucleic acid 

(C) STRANDEDNESS: Single 

(D) TOPOLOGY: Linear 
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(ii) MOLECULE TYPE: DNA 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 1308 
ATGTAYGTIA TIATGGAYMG IGC 



2) INFORMATION FOR SEQ ID NO: 1309 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 23 bases 

(B) TYPE: Nucleic acid 

(C) STRANDEDNESS: Single 

(D) TOPOLOGY: Linear 

(ii) MOLECULE TYPE: DNA 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 1309 
ATIATYTTRT TICCYTTICC YTT 



2) INFORMATION FOR SEQ ID NO: 1310 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 20 bases 

(B) TYPE: Nucleic acid 

(C) STRANDEDNESS: Single 

(D) TOPOLOGY: Linear 

(ii) MOLECULE TYPE: DNA 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 1310 
ATIATITSIA TIACYTCRTC 



2) INFORMATION FOR SEQ ID NO: 1311 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 23 bases 

(B) TYPE: Nucleic acid 

(C) STRANDEDNESS: Single 

(D) TOPOLOGY: Linear 

(ii) MOLECULE TYPE: DNA 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 1311 
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23 



5 2) INFORMATION FOR SEQ ID NO: 1312 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 23 bases 

(B) TYPE: Nucleic acid 
10 (C) STRANDEDNESS: Single 

(D) TOPOLOGY: Linear 

(ii) MOLECULE TYPE: DNA 
15 (xi) SEQUENCE DESCRIPTION: SEQ ID NO: 1312 

AARTAYATIA TICARGARMG IGC 23 



20 



30 



35 



2 ) INFORMATION FOR SEQ ID NO: 1313 



(i) SEQUENCE CHARACTERISTICS: 
(A) LENGTH: 23 bases 
25 (B) TYPE: Nucleic acid 

(C) STRANDEDNESS: Single 

(D) TOPOLOGY: Linear 



(ii) MOLECULE TYPE: DNA 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 1313 
AMIAYICKRT GIGGITTITT YTT 23 

2) INFORMATION FOR SEQ ID NO: 1314 



(i) SEQUENCE CHAIUVCTERISTICS : 
40 (A) LENGTH: 23 bases 

(B) TYPE: Nucleic acid 

(C) STRANDEDNESS: Single 

(D) TOPOLOGY: Linear 

45 (ii) MOLECULE TYPE: DNA 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 1314 
TAIGAITTYA CIGAISMICA RGC 23 

50 

2) INFORMATION FOR SEQ ID NO: 1315 
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(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 23 bases 

(B) TYPE: Nucleic acid 

(C) STRANDEDNESS: Single 

(D) TOPOLOGY: Linear 

(ii) MOLECULE TYPE: DNA 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 1315 
ACIATIGCIT CIGCYTGIKS YTC 



2) INFORMATION FOR SEQ ID NO: 1316 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 22 bases 

(B) TYPE: . Nucleic acid 

(C) STRANDEDNESS: Single 

(D) TOPOLOGY: Linear 

(ii) MOLECULE TYPE: DNA 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 1316 
GTGAGTGAAA TAATTCAAGA TT 



2) INFORMATION FOR SEQ ID NO: 1317 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 22 bases 

(B) TYPE: Nucleic acid 

(C) STRANDEDNESS: Single 

(D) TOPOLOGY: Linear 

(ii) MOLECULE TYPE: DNA 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 1317 
CACCAAAATC ATCTGTATCT AC 



2) INFORMATION FOR SEQ ID NO: 1318 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 26 bases 

(B) TYPE: Nucleic acid 

(C) STRANDEDNESS: Single 
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10 



20 



25 



(ii) MOLECULE TYPE; DNA 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 1318 
ACCTAYTCSA TGTACGTRAT CATGGA 26 

2) INFORMATION FOR SEQ ID NO: 1319 



(i) SEQUENCE CHARACTERISTICS: 
(A) LENGTH: 23 bases 
15 (B) TYPE: Nucleic acid 

(C) STRANDEDNESS: Single 

(D) TOPOLOGY: Linear 



(ii) MOLECULE TYPE: DNA 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 1319 
AGRTCGTCIA CCATCGGYAG YTT 23 

2) INFORMATION FOR SEQ ID NO: 1320 



(i> SEQUENCE CHARACTERISTICS: 
30 (A) LENGTH: 2259 bases 

(B) TYPE: Nucleic acid 

(C) STRANDEDNESS: Double 

(D) TOPOLOGY: Linear 

35 (ii) MOLECULE TYPE: Genomic DNA 

(vi) ORIGINAL SOURCE: 

(A) ORGANISM: Escherichia coli 

(B) STRAIN: K-12 MG1655 

40 (C) ACCESSION NUMBER: AE000384 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 1320 

ATGAGCGATA TGGCAGAGCG CCTTGCGCTA CATGAATTTA CGGAAAACGC 50 

45 CTACTTAAAC TACTCCATGT ACGTGATCAT GGACCGTGCG TTGCCGTTTA 100 

TTGGTGATGG TCTGAAACCT GTTCAGCGCC GCATTGTGTA TGCGATGTCT 150 

GAACTGGGCC TGAATGCCAG CGCCAAATTT AAAAAATCGG CCCGTACCGT 200 

CGGTGACGTA CTGGGTAAAT ACCATCCGCA CGGCGATAGC GCCTGTTATG 250 

AAGCGATGGT CCTGATGGCG CAACCGTTCT CTTACCGTTA TCCGCTGGTT 300 

50 GATGGTCAGG GGAACTGGGG CGCGCCGGAC GATCCGAAAT CGTTCGCGGC 350 

AATGCGTTAC ACCGAATCCC GGTTGTCGAA ATATTCCGAG CTGCTATTGA 4 00 

GCGAGCTGGG GCAGGGGACG GCTGACTGGG TGCCAAACTT CGACGGCACT 450 

TTGCAGGAGC CGAAAATGCT ACCTGCCCGT CTGCCAAACA TTTTGCTTAA 500 
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CGGCACCACC GGTATTGCCG TCGGCATGGC GACCGATATT CCACCGCATA 
ACCTGCGTGA AGTGGCTCAG GCGGCAATCG CATTAATCGA CCAGCCGAAA 
ACCACGCTCG ATCAGCTGCT GGATATCGTG CAGGGGCCGG ATTATCCGAC 
TGAAGCGGAA ATTATCACTT CGCGCGCCGA GATCCGTAAA ATCTACGAGA 
5 ACGGACGTGG TTCAGTGCGT ATGCGCGCGG TGTGGAAGAA AGAAGATGGC 
GCGGTGGTTA TCAGCGCATT GCCGCATCAG GTTTCAGGTG CGCGCGTACT 
GGAGCAAATT GCTGCGCAAA TGCGCAACAA AAAGCTGCCG ATGGTTGACG 
ATCTGCGCGA TGAATCTGAC CACGAGAACC CGACCCGCCT GGTGATTGTG 
CCGCGTTCCA ACCGCGTGGA TATGGATCAG GTGATGAACC ACCTCTTCGC 

10 TACCACCGAT CTGGAAAAGA GCTATCGTAT TAACCTTAAT ATGATCGGTC 
TGGATGGTCG TCCGGCGGTG AAAAACCTGC TGGAAATCCT CTCCGAATGG 
CTGGTGTTCC GCCGCGATAC CGTGCGCCGC CGACTGAACT ATCGTCTGGA 
GAAAGTCCTC AAGCGCCTGC ATATCCTCGA AGGTTTGCTG GTGGCGTTTC 
TCAATATCGA CGAAGTGATT GAGATCATTC GTAATGAAGA TGAACCGAAA 

15 CCGGCGCTGA TGTCGCGGTT TGGCCTTACG GAAACCCAGG CGGAAGCGAT 
CCTCGAACTG AAACTGCGTC ATCTTGCCAA ACTGGAAGAG ATGAAGATTC 
GCGGTGAGCA GAGTGAACTG GAAAAAGAGC GCGACCAGTT GCAGGGCATT 
TTGGCTTCCG AGCGTAAAAT GAATAACCTG CTGAAGAAAG AACTGCAGGC 
AGACGCGCAA GCCTACGGTG ACGATCGTCG TTCGCCGTTG CAGGAACGCG 

20 AAGAAGCGAA AGCGATGAGC GAGCACGACA TGCTGCCGTC TGAACCTGTC 
ACCATTGTGC TGTCGCAGAT GGGCTGGGTA CGCAGCGCTA AAGGCCATGA 
TATCGACGCG CCGGGCCTGA ATTATAAAGC GGGTGATAGC TTCAAAGCGG 
CGGTGAAAGG TAAGAGCAAC CAACCGGTAG TGTTTGTTGA TTCCACCGGT 
CGTAGCTATG CCATTGACCC GATTACGCTG CCGTCGGCGC GTGGTCAGGG 

25 CGAGCCGCTC ACCGGCAAAT TAACGTTGCC GCCTGGGGCG ACCGTTGACC 
ATATGCTGAT GGAAAGCGAC GATCAGAAAC TGCTGATGGC TTCCGATGCG 
GGTTACGGTT TCGTCTGCAC CTTTAACGAT CTGGTGGCGC GTAACCGTGC 
AGGTAAGGCT TTGATCACCT TACCGGAAAA TGCCCATGTT ATGCCGCCGG 
TGGTGATTGA AGATGCTTCC GATATGCTGC TGGCAATCAC TCAGGCAGGC 

30 CGTATGTTGA TGTTCCCGGT AAGTGATCTG CCGCAGCTGT CGAAGGGCAA 
AGGCAACAAG ATTATCAACA TTCCATCGGC AGAAGCCGCG CGTGGAGAAG 
ATGGTCTGGC GCAATTGTAC GTTCTGCCGC CGCAAAGCAC GCTGACCATT 
CATGTTGGGA AACGCAAAAT TAAACTGCGC CCGGAAGAGT TACAGAAAGT 
CACTGGCGAA CGTGGACGCC GCGGTACGTT GATGCGCGGT TTGCAGCGTA 

35 TCGATCGTGT TGAGATCGAC TCTCCTCGCC GTGCCAGCAG GGGTGATAGC 
GAAGAGTAA 



40 2) INFORMATION FOR SEQ ID NO: 1321 



550 
600 
650 
700 
750 
800 
850 
900 
950 
1000 
1050 
1100 
1150 
1200 
1250 
1300 
1350 
1400 
1450 
1500 
1550 
1600 
1650 
1700 
1750 
1800 
1850 
1900 
1950 
2000 
2050 
2100 
2150 
2200 
2250 
2259 



(i) 



SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 2403 bases 

(B) TYPE: Nucleic acid 

(C) STRANDEDNESS: Double 

(D) TOPOLOGY: Linear 



45 



(ii) 



MOLECULE TYPE: Genomic DNA 



50 



(vi) 



ORIGINAL SOURCE: 

(A) ORGANISM: Staphylococcus aureus 

(B) STRAIN: KMP9 

(C) ACCESSION NUMBER: D67074 
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(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 1321 

GTGAGTGAAA TAATTCAAGA TTTATCACTT GAAGATGTTT TAGGTGATCG 50 

• 5 CTTTGGAAGA TATAGTAAAT ATATTATTCA AGAGCGTGCA TTGCCAGATG 100 

TTCGTGATGG TTTAAAACCA GTACT^CGTC GTATTTTATA TGCAATGTAT 150 

TCAAGTGGTA ATACACACGA TAAAAATTTC CGTAAAAGTG CGAAAACAGT 200 

CGGTGATGTT ATTGGTCAAT ATCATCCACA TGGAGACTTC TCAGTGTACA 250 

AAGCAATGGT CCGTTTAAGT CAAGACTGGA AGTTACGACA TGTCTTAATA 300 

10 GAAATGCATG GTAATAATGG TAGTATCGAT AATGATCCGC CAGCGGCAAT 350 

GCGTTACACT GAAGCTAAGT TAAGCTTACT AGCTGAAGAG TTATTACGTG 4 00 

ATATTAATAA AGAGACAGTT TCTTTCATTC CAAACTATGA TGATACGACA 450 

CTCGAACCAA TGGTATTGCC ATCAAGATTT CCTAACTTAC TAGTGAATGG 500 

TTCTACAGGT ATATCTGCAG GTTACGCGAC AGATATACCA CCACATAATT 55Q 

15 TAGCTGAAGT GATTCAAGCA ACACTTAAAT ATATTGATAA TCCGGATATT 600 

ACAGTCAATC AATTAATGAA ATATATTAAA GGTCCTGATT TTCCAACTGG 650 

TGGTATTATT CAAGGTATTG ATGGTATTAA AAAAGCTTAT GAATCAGGTA 700 

AAGGTAGAAT TATAGTTCGT TCTAAAGTTG AAGAAGAAAC TTTACGCAAT 750 

GGACGTAAAC AGTTAATTAT TACTGAAATT CCATATGAAG TGAACAAAAG 800 

20 TAGCTTAGTA AAACGTATCG ATGAATTACG TGCTGACAAA AAAGTCGATG 850 

GTATCGTTGA AGTACGTGAT GAAACTGATA GAACTGGTTT ACGAATAGCA 900 

ATTGAATTGA AAAAAGATGT GAACAGTGAA TCAATCAAAA ATTATCTTTA 950 

TAAAAACTCT GATTTACAGA TTTCATATAA TTTCAACATG GTCGCTATTA 1000 
GTGATGGTCG TCCAAAATTG ATGGGTATTC GTCAAATTAT AGATAGTTAT * 1050 

25 TTGAATCATC AAATTGAGGT TGTTGCAAAT AGAACGAAGT TTGAATTAGA 1100 

TAATGCTGAA AAACGTATGC ATATCGTTGA AGGTTTGATT AAAGCGTTGT 1150 

CAATTTTAGA TAAAGTAATT GAATTGATTC GTAGCTCTAA AAACT^GCGT 1200 

GACGCTAAAG AAAAGCTTAT CGAAGTATTC GAGTTCACAG AAGAACAGGC 1250 

TGAAGCAATT GTAATGTTAC AGTTATATCG TTTAACAAAC ACTGACATAG 1300 

30 TTGCGCTTGA AGGTGAACAT AAAGAACTTG AAGCATTAAT CAAACAATTA 1350 

CGTCATATTC TTGATAACCA TGATGCATTA TTGAATGTCA TAAAAGAAGA 1400 

ATTGAATGAA ATTAAAAAGA AATTCAAATC TGAACGACTG TCTTTAATTG 14 50 

AAGCAGAAAT TGAAGAAATT AAAATTGACA AAGAAGTTAT GGTGCCTAGT 1500 

GAAGAAGTTA TTTTAAGTAT GACACGTCAT GGATATATTA i\ACGTACTTC 1550 

35 TATTCGTAGC TTTAATGCTA GCGGTGTTGA GGATATTGGT TTAAAAGATG 1600 

GTGACAGTTT ACTTAAACAT CAAGAAGTAA ATACGCAAGA TACCGTACTA 1650 

GTATTTACAA ATAAAGGTCG TTATCTATTT ATACCGGTTC ATAAATTAGC 1700 

AGATATTCGT TGGAAAGAAT TGGGGCAACA TGTATCACAA ATAGTTCCTA 1750 

TCGAAGAAGA TGAAGTGGTT ATTAATGTCT TTAATGAAT^ GGACTTTAAT 1800 

40 ACAGATGCAT TTTATGTTTT TGCGACTCAA AATGGCATGA TTAAGAAAAG 1850 

TACAGTGCCT CTATTTAAAA CAACGCGTTT TAATAAACCT TTAATTGCTA 1900 

CTAAAGTTAA AGAAAATGAT GATTTGATTA GTGTTATGCG CTTTGAAAAA 1950 

GATCAATTAA TTACCGTCAT TACTAATAAA GGTATGTCAT T7y\CGTATAA 2000 

TACAAGTGAA CTATCAGATA CCGGATTAAG GGCAGCTGGT GTTAAATCAA 2050 

45 TAAATCTTAA AGCTGAAGAT TTCGTTGTTA TGACAGAAGG TGTTTCTGAA 2100 

AATGATACTA TATTGATGGC CACACAACGC GGCTCGTTAA AACGTATTAG 2150 

TTTTAAAATC TTACAAGTTG CTAAAAGAGC ACAACGTGGA ATAACTTTAT 2200 

TAAAAGAATT AAAGAAAAAT CCACATCGTA TTGTAGCTGC ACATGTAGTG 2250 

ACAGGTGAAC ATAGTCAATA TACATTATAT TCAAAATCAA ATGAAGAACA 2300 

50 TGGTTTAATT AATGATATTC ATAAATCTGA ACAATATACA AATGGCTCAT 2350 

TCATTGTAGA TACAGATGAT TTTGGTGAAG TAATAGACAT GTATATTAGC 24 00 

TAA 2403 
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(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 22 bases 

(B) TYPE: Nucleic acid 

(C) STRANDEDNESS: Single 

(D) TOPOLOGY: Linear 

(ii) MOLECULE TYPE: DNA 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 1322 
RTIGAIAAYI SIGTIGAYGA RG 



2) INFORMATION FOR SEQ ID NO: 1323 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 22 bases 

(B) TYPE: Nucleic acid 

(C) STRANDEDNESS: Single 

(D) TOPOLOGY: Linear 

(ii) MOLECULE TYPE: DNA 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 1323 
ACIAWRSAIG GIGGIACICA YG 



2) INFORMATION FOR SEQ ID NO: 1324 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 20 bases 

(B) TYPE: Nucleic acid 

(C) STRANDEDNESS: Single 

(D) TOPOLOGY: Linear 

(ii) MOLECULE TYPE: DNA 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 1324 
CCICCIGCIS WRTCICCYTC 



2) INFORMATION FOR SEQ ID NO: 1325 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 21 bases 

(B) TYPE: Nucleic acid 

716 



SUBSTITUTE SHEET (RULE 26) 



10 



45 



WO 01/23604 PCT/CAOO/01150 

(C) STRANDEDNESS : Single 

(D) TOPOLOGY: Linear 

(ii) MOLECULE TYPE: DNA 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 1325 
RTTCATYTCI CCIARICCYT T 21 



2) INFORMATION FOR SEQ ID NO: 1326 



(i) SEQUENCE CHARACTERISTICS: 
15 (A) LENGTH: 22 bases 

(B) TYPE: Nucleic acid 

(C) STRANDEDNESS: Single 

(D) TOPOLOGY: Linear 

20 (ii) MOLECULE TYPE: DNA 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 1326 
TGATTCAATA CAGGTTTTAG AG 22 

25 

2) INFORMATION FOR SEQ ID NO: 1327 

30 (i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 22 bases 

(B) TYPE: Nucleic acid 

(C) STRANDEDNESS: Single 

(D) TOPOLOGY: Linear 

35 

(ii) MOLECULE TYPE: DNA 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 1327 
40 CTAGATTTCC TCCTCATCAA AT 22 



2) INFORMATION FOR SEQ ID NO: 1328 



(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 1992 bases 

(B) TYPE: Nucleic acid 

(C) STRANDEDNESS: Double 
50 (D) TOPOLOGY: Linear 

(ii) MOLECULE TYPE: Genomic DNA 
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(vi) ORIGINAL SOURCE: 

(A) ORGANISM: Staphylococcus aureus 

(B) STRAIN: RN4220 

(C) ACCESSION NUMBER: D67075 

5 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 1328 

ATGAATAAAC AAAATAATTA TTCAGATGAT TCAATACAGG TTTTAGAGGG 50 

GTTAGAAGCA GTTCGTAAAA GACCTGGTAT GTATATTGGA TC7\ACTGATA 100 

10 AACGGGGATT ACATCATCTA GTATATGAAA TTGTCGATAA CTCCGTCGAT 150 

GAAGTATTGA ATGGTTACGG T7VACGAAATA GATGTAACAA TTAATAAAGA 200 

TGGTAGTATT TCTATAGAAG ATAATGGACG TGGTATGCCA ACAGGTATAC 250 

ATAAATCAGG TAAACCGACA GTCGAAGTTA TCTTTACTGT TTTACATGCA 300 

GGAGGTAAAT TTGGACAAGG CGGCTATAAA ACTTCAGGTG GTCTTCACGG 350 

15 TGTTGGTGCT TCAGTTGTAA ATGCATTGAG TGAATGGCTT GAAGTTGAAA 4 00 

TCCATCGAGA TGGTAATATA TATCATCAAA GTTTTAAAAA CGGTGGTTCG 4 50 

CCATCTTCTG GTTTAGTGAA AAAAGGTAAA ACTAAGAAAA CAGGTACCAA 500 

AGTAACATTT AAACCTGATG ACACAATTTT TAAAGCATCT ACATCATTTA 550 

ATTTTGATGT TTTAAGTGAA CGACTACAAG AGTCTGCGTT CTTATTGAAA 600 

20 AATTTAAAAA TAACGCTTAA TGATTTACGC AGTGGTAAAG AGCGTCAAGA 650 

GCATTACCAT TATGAAGAAG GAATCAAAGA GTTTGTTAGT TATGTCAATG 700 

AAGGAAAAGA AGTTTTGCAT GACGTGGCTA CATTTTCAGG TGAAGCAAAT 750 

GGTATAGAGG TAGACGTAGC TTTCCAATAT AATGATCAAT ATTCAGAAAG 8 00 

TATTTTAAGT TTTGTAAATA ATGTACGTAC TAAAGATGGT GGTACACATG 850 

25 AAGTTGGTTT TAAAACAGCA ATGACACGTG TATTTAATGA TTATGCACGT 900 

CGTATTAATG AACTTAAAAC AAAAGATAAA AACTTAGATG GTAATGATAT 950 

TCGTGAAGGT TTAACAGCTG TTGTGTCTGT TCGTATTCCA GAAGAATTAT 1000 

TGCAATTTGA AGGACAAACG AAATCTAAAT TGGGTACTTC TGAAGCTAGA 1050 

AGTGCTGTTG ATTCAGTTGT TGCAGACAAA TTGCCATTCT ATTTAGAAGA 1100 

3 0 AAAAGGACAA TTGTCTAAAT CACTTGTGAA MiMiGCGATT 7VAAGCACAAC 1150 

AAGCAAGGGA AGCTGCACGT AAAGCTCGTG AAGATGCTCG TTCAGGTAAG 12 00 

AAAAACAAGC GTAAAGACAC TTTGCTATCT GGTAAATTAA CACCTGCACA 12 50 

AAGTAAAAAC ACTGAAAAAA ATGAATTGTA TTTAGTCGAA GGTGATTCTG 1300 

CGGGAGGTTC AGCAAAACTT GGACGAGACC GCAAATTCCA AGCGATATTA 1350 

35. CCATTACGTG GTAAGGTAAT TAATACAGAG AAAGCACGTC TAGAAGATAT 14 00 

TTTTAAAAAT GAAGAAATTA ATACAATTAT CCACACAATC GGGGCAGGCG 14 50 

TTGGTACTGA CTTTAAAATT GAAGATAGTA ATTATAATCG TGTAATTATT 1500 

ATGACTGATG CTGATACTGA TGGTGCGCAT ATTCAAGTGC TATTGTTAAC 1550 

ATTCTTCTTC AAATATATGA AACCGCTTGT TCAAGCAGGT CGTGTATTTA 1600 

4 0 TTGCTTTACC TCCACTTTAT AAATTGGAAA AAGGTAAAGG CAAAACAAAG 1650 

CGAGTTGAAT ACGCTTGGAC AGACGAAGAG CTTAATAAAT TGCAAAAAGA 17 00 

ACTTGGTAAA GGCTTCACGT TACAACGTTA CAAAGGTTTG GGTGAAATGA 17 50 

ACCCTGAGCA ATTATGGGAA ACGACGATGA ACCCAGAAAC ACGAACTTTA 18 00 

ATTCGTGTAC AAGTTGAAGA TGAAGTGCGT TCATCTAAAC GTGTAACAAC 1850 

45 ATTAATGGGT GACAAAGTAC AACCTAGACG TGAATGGATT GAAAAGCATG 1900 

TTGAGTTTGG TATGCAAGAG GACCAAAGTA TTTTAGATAA TTCTGAAGTA 1950 

CAAGTGCTTG AAAATGATCA ATTTGATGAG GAGGAAATCT AG 1992 



50 

2) INFORMATION FOR SEQ ID NO: 1329 

(i) SEQUENCE CHARACTERISTICS: 

718 



SUBSTITUTE SHEET (RULE 26) 



wo 01/23604 



PCT/CAOO/01150 



(A) LENGTH: 25 bases 

(B) TYPE: Nucleic acid 

(C) STRANDEDNESS : Single 

(D) TOPOLOGY: Linear 

(ii) MOLECULE TYPE: DNA 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 1329 
TGTAGAGCGC GGTATCATCA AAGTA 



2) INFORMATION .FOR SEQ ID NO: 1330 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 22 bases 

(B) TYPE: Nucleic acid 

(C) STRANDEDNESS: Single 

(D) TOPOLOGY: Linear 

(ii) MOLECULE TYPE: DNA 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 1330 
AGATTCGAAC TTGGTGTGCG GG 



2 ) INFORMATION FOR SEQ ID NO: 1331 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 30 bases 

(B) TYPE: Nucleic acid 

(C) STRANDEDNESS: Single 

(D) TOPOLOGY: Linear 

(ii) MOLECULE TYPE: DNA 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 1331 
GCCCTTGAGG TACAGAATGG TAATGAAGTT 



2) INFORMATION FOR SEQ ID NO: 1332 

(.i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 20 bases 

(B) TYPE: Nucleic acid 

(C) STRANDEDNESS: Single 

(D) TOPOLOGY: Linear 
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(ii) MOLECULE TYPE: DNA 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 1332 
GACCGCGGCG CAGACCATCA 20 

2) INFORMATION FOR SEQ ID NO: 1333 



(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 23 bases 

(B) TYPE: Nucleic acid 

(C) STRANDEDNESS : Single 
15 (D) TOPOLOGY: Linear 

(ii) MOLECULE TYPE: DNA 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 1333 

20 

TCATGGTGAC TTATCTATTT ATG 23 



25 2) INFORMATION FOR SEQ ID NO: 1334 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 23 bases 

(B) TYPE: Nucleic acid 
30 (C) STRANDEDNESS: Single 

(D) TOPOLOGY: Linear 

(ii) MOLECULE TYPE: DNA 
35 (xi) SEQUENCE DESCRIPTION: SEQ ID NO: 1334 

CATCTATTTA TAAAGCAATG GTA 23 



2) INFORMATION FOR SEQ ID NO: 1335 



(i) SEQUENCE CHARACTERISTICS: 
(A) LENGTH: 19 bases 
45 (B) TYPE: Nucleic acid 

(C) STRANDEDNESS: Single 

(D) TOPOLOGY: Linear 



(ii) MOLECULE TYPE: DNA 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 1335 

CTATTTATGG AGCAATGGT 19 
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2) INFORMATION FOR SEQ ID NO: 1336 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 17 bases 

(B) TYPE: Nucleic acid 

(C) STRANDEDNESS: Single 

(D) TOPOLOGY: Linear 

(ii) MOLECULE TYPE: DNA 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 1336 
TGGAGACTAC TCAGTGT 



2) INFORMATION FOR SEQ ID NO: 1337 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 17 bases 

(B) TYPE: Nucleic acid 

(C) STRANDEDNESS: Single 

(D) TOPOLOGY: Linear 

(ii) MOLECULE TYPE: DNA 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 1337 
TGGAGACTTC TCAGTGT 



2) INFORMATION FOR SEQ ID NO: 1338 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 15 bases 

(B) TYPE: Nucleic acid 

(C) STRANDEDNESS: Single 

(D) TOPOLOGY: Linear 

(ii) MOLECULE TYPE: DNA 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 1338 
GTGTACGGAG CAATG 



2) INFORMATION FOR SEQ ID NO: 1339 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 15 bases 

(B) TYPE: Nucleic acid 
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(C) STRANDEDNESS : Single 

(D) TOPOLOGY: Linear 

(ii) MOLECULE TYPE: DNA 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 1339 
CCAGCGGAAA TGCGT 15 



2) INFORMATION FOR SEQ ID NO: 1340 



(i) SEQUENCE CHARACTERISTICS: 
15 (A) LENGTH: 25 bases 

(B) TYPE: Nucleic acid 

(C) STRANDEDNESS: Single 

(D) TOPOLOGY: Linear 

20 (ii) MOLECULE TYPE: DNA 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 1340 
GAACAAGGTA TGACACCGGA TAAAT 25 

25 

2) INFORMATION FOR SEQ ID NO: 1341 

30 (i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 25 bases 

(B) TYPE: Nucleic acid 

(C) STRANDEDNESS: Single 

(D) TOPOLOGY: Linear 

35 

(ii) MOLECULE TYPE: DNA 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 1341 
40 GATAACTGAA ATCCTGAGCC ATACG 25 



2) INFORMATION FOR SEQ ID NO: 1342 



(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 24 bases 

(B) TYPE: Nucleic acid 

(C) STRANDEDNESS: Single 
50 (D) TOPOLOGY: Linear 

(ii) MOLECULE TYPE: DNA 
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GATGTTATTG GTCAATATCA TCCA 24 



2) INFORMATION FOR SEQ ID NO: 1343 

(i) SEQUENCE CHARACTERISTICS: 
10 (A) LENGTH: 29 bases 

(B) TYPE: Nucleic acid 

(C) STRANDEDNESS : Single 

(D) TOPOLOGY: Linear 

15 (ii) MOLECULE TYPE: DNA 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 1343 
AAGAAACTGT CTCTTTATTA ATATCACGT 29 

20 

2) INFORMATION FOR SEQ ID NO: 1344 

25 (i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 24 bases 

(B) TYPE: Nucleic acid 

(C) STRANDEDNESS: Single 

(D) TOPOLOGY: Linear 

3 0 

(ii) MOLECULE TYPE: DNA 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 1344 
35 AGCAGCAACG ATGTTACGCA GCAG 24 



2) INFORMATION FOR SEQ ID NO: 1345 



40 



(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 23 bases 

(B) TYPE: Nucleic acid 

(C) STRANDEDNESS: Single 
45 (D) TOPOLOGY: Linear 

(ii) MOLECULE TYPE: DNA 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 1345 

50 

CCCGCCGAGC ATTTCAACTA TTG 23 
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2) INFORiyiATION FOR SEQ ID NO: 1346 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 22 bases 

(B) TYPE: Nucleic acid 

(C) STRANDEDNESS: Single 

(D) TOPOLOGY: Linear 

(ii) MOLECULE TYPE: DNA 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 1346 
GATGTTACGC AGCAGGGCAG TC 



2) INFORMATION FOR SEQ ID NO: 1347 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 24 bases 

(B) TYPE: Nucleic acid 

(C) STRANDEDNESS: Single 

(D) TOPOLOGY: Linear 

(ii) MOLECULE TYPE: DNA 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 1347 
ACCAAGCAGG TTCGCAGTCA ACTA 



2) INFORMATION FOR SEQ ID NO: 1348 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 750 bases 

(B) TYPE: Nucleic acid 

(C) STRANDEDNESS: Double 

(D) TOPOLOGY: Linear 

(ii) MOLECULE TYPE: Genomic DNA 

(vi) ORIGINAL SOURCE: 

(A) ORGANISM: Unidentified bacterium 
(C) ACCESSION NUMBER: X04555 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 1348 

ATGCGCTCAC GCAACTGGTC CAGAACCTTG ACCGAACGCA GCGGTGGTAA 50 

CGGCGCAGTG GCGGTTTTCA TGGCTTGTTA TGACTGTTTT TTTGTACAGT 100 

CTATGCCTCG GGCATCCAAG CAGCAAGCGC GTTACGCCGT GGGTCGATGT 150 

TTGATGTTAT GGAGCAGCAA CGATGTTACG CAGCAGGGCA GTCGCCCTAA 200 

AACAAAGTTA GGCCGCATGG ACACAACGCA GGTCACATTG ATACACA7\AA 250 
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TTCTAGCTGC GGCAGATGAG 
TGGGCGATCG ATGCACGGCT 
TGATCTGACG TTTCCCGGCG 
AAATGCTCGG CGGGCGCGTC 
5 GAGATCGGGG ATGAGTTACT 
AGCGTATGAA ATCGCGGAGG 
AGGGCGTCAT CGCCGGGCGG 
ATCTGGGATT ACTTTTACTA 
TACAAAGCAC ATAGAGTCCT 
10 AAAAGGTTGA GGTCTTGCGT 



CGAAATCTGC CGCTCTGGAT 
AGGGCGTGTA ACACGCAAGC 
AGAGGCGCGG CGAGCTCGAG 
ATGGAGGAGT TGGACTATGG 
TGACTGCGAA CCTGCTTGGT 
CTCCGCAGGG CTCGTGCCCA 
CCAGTCCGTT GTAACAGCTG 
TGCCGATGAA GTACCACCAG 
ACAGGCTCGC ATGCACCTCA 
GCCGCTTTCA GGTCGCGATA 



PCT/CAOO/01150 

CGGTGGGGGC 300 

ACGATGATAT 350 

GCAATAGTTG 400 

ATTCTTAGCG 450 

GGGCAGACGA 500 

GAGGCGGCTG 550 

GGAGGCGATC 600 

TGGACTGGCC 650 

CTCGGGGCGG 700 

TGCGGCCTAA 750 



2) INFORMATION FOR SEQ ID NO: 1349 



15 



(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 24 bases 

(B) TYPE: Nucleic acid 

(C) STRANDEDNESS : Single 
20 (D) TOPOLOGY: Linear 

(ii) MOLECULE TYPE: DNA 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 1349 

25 

CAGCCGACCA ATGAGTATCT TGCC 24 



30 2 ) INFORMATION FOR SEQ ID NO: 1350 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 24 bases 

(B) TYPE: Nucleic acid 
35 (C) STRANDEDNESS: Single 

(D) TOPOLOGY: Linear 

(ii) MOLECULE TYPE: DNA 
40 (xi) SEQUENCE DESCRIPTION: SEQ ID NO: 1350 

TAATCAGGGC AGTTGCGACT CCTA 24 



45 

2) INFORMATION FOR SEQ ID NO: 1351 

(i) SEQUENCE CHARACTERISTICS: 
(A) LENGTH: 531 bases 
50 (B) TYPE: Nucleic acid 

(C) STRANDEDNESS: Double 

(D) TOPOLOGY: Linear 
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(ii) MOLECULE TYPE: Genomic DNA 

(vi) ORIGINAL SOURCE: 

(A) ORGANISM: Pseudomonas aeruginosa 
5 (B) STRAIN: Stone 130 

(C) ACCESSION NUMBER: L06157 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 1351 

10 ATGTTATGGA GCAGCAACGA TGTTACGCAG CAGGGCAGTC GCCCTAAAAC 50 

AAAGTTAGGT GGCTCAATGA GCATCATTGC AACCGTCAAG ATCGGCCCTG 100 

ACGAAATTTC AGCCATGAGG GCTGTGCTCG ATCTCTTCGG CAAAGAGTTT 150 

GAGGACATTC CAACCTACTC TGATCGCCAG CCGACCAATG AGTATCTTGC 200 

CAATCTTCTG CACAGCGAGA CGTTCATCGC GCTCGCTGCT TTTGACCGCG 250 

15 GAACAGCAAT AGGTGGGCTC GCCGCCTACG TTCTACCCAA GTTCGAGCAA 300 

GCGCGAAGCG AGATCTACAT TTATGACTTG GCAGTCGCTT CCAGCCATCG 350 

AAGGCTAGGA GTCGCAACTG CCCTGATTAG CCACCTGAAG CGTGTGGCGG 400 

TTGAACTTGG CGCGTATGTA ATCTATGTGC AAGCAGACTA CGGTGACGAT 450 

CCGGCAGTCG CTCTCTACAC AAAGCTTGGA GTTCGGGAAG ACGTCATGCA 500 

20 CTTCGACATT GATCCAAGAA CCGCCACCTA A 531 



25 



2) INFORMATION FOR SEQ ID NO: 1352 



(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 22 bases 

(B) TYPE: Nucleic acid 

(C) STRANDEDNESS: Single 
30 (D) TOPOLOGY: Linear 

(ii) MOLECULE TYPE: DNA 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 1352 

35 

CCACGCTGAC AGAGCCGCAC CG 22 



40 2) INFORMATION FOR SEQ ID NO: 1353 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 22 bases 

(B) TYPE: Nucleic acid 
45 (C) STRANDEDNESS: Single 

(D) TOPOLOGY: Linear 

(ii) MOLECULE TYPE: DNA 
50 (xi) SEQUENCE DESCRIPTION: SEQ ID NO: 1353 

GGCCAGCTCC CATCGGACCC TG 22 
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10 



15 



40 



(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 21 bases 

(B) TYPE: Nucleic acid 

(C) STRANDEDNESS: Single 

(D) TOPOLOGY: Linear 

(ii) MOLECULE TYPE: DNA 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 1354 
CACGCTGACA GAGCCGCACC G 21 

2) INFORMATION FOR SEQ ID NO: 1355 



(i) SEQUENCE CHARACTERISTICS: 
20 (A) LENGTH: 24 bases 

(B) TYPE: Nucleic acid 

(C) STRANDEDNESS: Single 

(D) TOPOLOGY: Linear 

25 (ii) MOLECULE TYPE: DNA 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 1355 
ATGCCGTTGC TGTCGAAATC CTCG 24 

30 

2) INFORMATION FOR SEQ ID NO: 1356 

35 (i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 810 bases 

(B) TYPE: Nucleic acid 

(C) STRANDEDNESS: Double 

(D) TOPOLOGY: Linear 



(ii) MOLECULE TYPE: Genomic DNA 



(vi) ORIGINAL SOURCE: 

(A) ORGANISM: Serratia marcescens 
45 (C) ACCESSION NUMBER: M97172 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 1356 

ATGAACACGA TCGAATCGAT CACGGCGGAC CTGCACGGAC TGGGCGTCCG 50 

50 GCCCGGCGAC CTGATCATGG TCCATGCATC GCTGAAAGCC GTCGGCCCGG 100 

TCGAGGGAGG TGCGGCCTCG GTGGTGTCGG CCCTTCGCGC CGCGGTCGGG 150 

TCCGCAGGGA CCCTGATGGG TTATGCCTCA TGGGACCGCT CGCCCTATGA 200 

GGAGACGCTG AACGGCGCGC GGATGGACGA AGAACTGCGC CGCCGGTGGC 250 
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CACCCTTCGA TCTGGCCACA TCCGGTACCT ATCCCGGCTT CGGCCTGCTC 300 

AACCGGTTTC TGCTTGAGGC GCCCGACGCA CGGCGCAGCG CGCATCCCGA 350 

CGCCTCCATG GTCGCGGTCG GCCCCCTTGC CGCCACGCTG ACAGAGCCGC 400 

ACCGGCTTGG GCAGGCGCTG GGCGAAGGCT CGCCGCTGGA GCGCTTCGTC 4 50 

5 GGGCATGGCG GAAAGGTCCT GCTTCTGGGA GCGCCGCTCG ACTCCGTCAC 500 

CGTGCTGCAT TACGCCGAGG CCATCGCCCC CATCCCGAAC AAACGCCGCG 550 

TGACCTATGA T^TGCCGATG CTCGGCCCGG ATGGCAGGGT CCGATGGGAG 600 

CTGGCCGAGG ATTTCGACAG CAACGGCATT CTCGATTGCT TCGCGGTCGA 650 

TGGGAAGCCG GATGCCGTCG AGACGATCGC CAAGGCTTAT GTCGAACTGG 700 

10 GCCGGCATCG GGAAGGCATC GTCGGTCGCG CACCCTCCTA TCTGTTTGAA 750 

GCGCAGGATA TCGTCTCGTT CGGCGTCACC TATCTCGAAC AGCATTTCGG 800 

CGCGCCCTGA 810 



15 

2) INFORMATION FOR SEQ ID NO: 1357 

(i) SEQUENCE CHARACTERISTICS: 
(A) LENGTH: 20 bases 
20 (B) TYPE: Nucleic acid 

(C) STRANDEDNESS : Single 

(D) TOPOLOGY: Linear 



25 



(ii) MOLECULE TYPE: DNA 
(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 1357 
GCCCATCCAT TTGCCTTTGC 20 

30 

2) INFORMATION FOR SEQ ID NO: 1358 

(i) SEQUENCE CHARACTERISTICS: 
35 (A) LENGTH: 24 bases 

(B) TYPE: Nucleic acid 

(C) STRANDEDNESS: Single 

(D) TOPOLOGY: Linear 

40 (ii) MOLECULE TYPE: DNA 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 1358 
GCGTACCAAC TTGCCATCCT GAAG 24 

45 



2) INFORMATION FOR SEQ ID NO: 1359 

50 (i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 19 bases 

(B) TYPE: Nucleic acid 

(C) STRANDEDNESS: Single 
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(D) TOPOLOGY: Linear 
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(ii) MOLECULE TYPE: DNA 
5 (xi) SEQUENCE DESCRIPTION: SEQ ID NO: 1359 

TGCCCCTGCC ACCTCACTC 19 



10 



20 



25 



2) INFORMATION FOR SEQ ID NO: 1360 



(i) SEQUENCE CHARACTERISTICS: 
(A) LENGTH: 24 bases 
15 (B) TYPE: Nucleic acid 

(C) STRANDEDNESS: Single 

(D) TOPOLOGY: Linear 



(ii) MOLECULE TYPE: DNA 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 1360 
CGTACCAACT TGCCATCCTG AAGA 24 

2) INFORMATION FOR SEQ ID NO: 1361 



(i) SEQUENCE CHARACTERISTICS: 
30 (A) LENGTH: 786 bases 

(B) TYPE: Nucleic acid 

(C) STRANDEDNESS: Double 

(D) TOPOLOGY: Linear 

35 (ii) MOLECULE TYPE: Genomic DNA 



(vi) ORIGINAL SOURCE: 

(A) ORGANISM: Escherichia coli 
(C) ACCESSION NUMBER: X01385 

40 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 1361 



GTGCAATACG AATGGCGAAA 
TGGGGTTACC CCCGGCGGTG 

45 TCCGGCCCCT CGAAGATGGG 
GCGCTGGGTC CGGGAGGGAC 
CGACGAGCCG TTCGATCCTG 
TTGTCTCTGA CACATTCTGG 
CCATTTGCCT TTGCGGCAGC 

50 TCCATTGCCC CTGCCACCTC 
ATGAACTCGA TGGGCAGGTA 
ACGACGCTGC ATCTTGCCGA 
GAGACACTGC ACCATTCTTC 



AGCCGAGCTC ATCGGTCAGC 
TGCTGCTGGT CCACAGCTCC 
CCACTTGGAC TGATCGAGGC 
GCTCGTCATG CCCTCGTGGT 
CCACGTCGCC CGTTACACCG 
CGCCTGCCAA ATGTAAAGCG 
GGGGCCACAG GCAGAGCAGA 
ACTCGCCTGC AAGCCCGGTC 
CTTCTCCTCG GCGTGGGACA 
GTTGATGGCA AAGGTTCCCT 
AGGATGGCAA GTTGGTACGC 

729 



TTCTCAACCT 50 

TTCCGTAGCG 100 

CCTGCGTGCT 150 

CAGGTCTGGA 200 

GACCTTGGAG 250 

CAGCGCCCAT 300 

TCATCTCTGA 350 

GCCCGTGTCC 400 

CGATGCCAAC 4 50 

ATGGGGTGCC 500 

GTCGATTATC 550 
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TCGAGAATGA CCACTGCTGT GAGCGCTTTG CCTTGGCGGA CAGGTGGCTC 600 

AAGGAGAAGA GCCTTCAGAA GGAAGGTCCA GTCGGTCATG CCTTTGCTCG 650 

GTTGATCCGC TCCCGCGACA TTGTGGCGAC AGCCCTGGGT CAACTGGGCC 700 

GAGATCCGTT GATCTTCCTG CATCCGCCAG AGGGCGGGAT GCGAAGAATG 750 

CGATGCCGCT CGCCAGTCGA TTGGCTGAGC TCATGA 786 



2) INFORMATION FOR SEQ ID NO: 1362 



(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 22 bases 

(B) TYPE: Nucleic acid 

(C) STRANDEDNESS : Single 
15 (D) TOPOLOGY: Linear 

(ii) MOLECULE TYPE: DNA 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 1362 

20 

CGCCGCCATC GCCCAAAGCT GG 22 

25 2) INFORMATION FOR SEQ ID NO: 1363 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 24 bases 

(B) TYPE: Nucleic acid 
30 (C) STRANDEDNESS: Single 

(D) TOPOLOGY: Linear 

(ii) MOLECULE TYPE: DNA 
35 (xi) SEQUENCE DESCRIPTION: SEQ ID NO: 1363 

CGGCATAATG GAGCGCGGTG ACTG 24 



2) INFORMATION FOR SEQ ID NO: 1364 



(i) SEQUENCE CHARACTERISTICS: 
(A) LENGTH: 24 bases 
45 (B) TYPE: Nucleic acid 

(C) STRANDEDNESS: Single 

(D) TOPOLOGY: Linear 



(ii) MOLECULE TYPE: DNA 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 1364 

TTTCTCGCCC ACGCAGGAAA AATC 24 
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2) INFORMATION FOR SEQ ID NO: 1365 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 22 bases 

(B) TYPE: Nucleic acid 

(C) STRANDEDNESS : Single 

(D) TOPOLOGY: Linear 

(ii) MOLECULE TYPE: DNA 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 1365 
CATCCTCGAC GAATATGCCG CG 



2) INFORMATION FOR SEQ ID NO: 1366 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 900 bases 

(B) TYPE: Nucleic acid 

(C) STRANDEDNESS: Double 

(D) TOPOLOGY: Linear 

(ii) MOLECULE TYPE: Genomic DNA 

(vi) ORIGINAL SOURCE: 

(A) ORGANISM: Enterobacter cloacae 
(C) ACCESSION NUMBER: M88012 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 1366 

CGACGGCACC 50 

CGCGACCTCG 100 

GCGCGCCGTC 150 

TGATCGAGGC 200 

CGCGATTCGC 250 

CATCGCCCAA 300 

CCGGCTTTGG 350 

CGCACGGCCC 400 

GTGGCTGGTG 450 

CGATCGCGCG 500 

GGGCCCGATG 550 

CGAGGGCAAG 600 

AGCGCGTCTG 650 

GAATATGCCG 700 

CTATCTCGCC 750 

CCCGGCTGAT 800 

GAGGCGCGCC 850 

CCGCGACTGA 900 



ATGACTGATC CCCGCAAAAA 
CGCGACGCCC TGGTCCAAAA 
GCGTGCGCTC AGGCGATATG 
GGGCCGCTGG CGGACGGACC 
CGTCGGCCCC ACCGGGAATA 
CCTATGAACA GACGCTGGGT 
AGCTGGCCTG CGTTCGACCC 
CGCGATCAAC GAATTTATCC 
ATCCCGACGC ATCGATGGCG 
GCGCCGCACG AGATGGGCGC 
TTTTCTCGCC CACGCAGGAA 
CAGTCACCGC GCTCCATTAT 
CGCCGCGTCA CTTATTCGAT 
GGTCACCACG TCCGACTGGG 
CGCCCGACGG CCCCGACGCG 
CGCACCAGGG TTGCGCAAGG 
CGACGCGGCC GATATCGTTT 
ACGCCGCGCC AGCGGCGGCA 



CGGCGATTTG CACGAACCCG 
GCGAGCTGGT CCGGCAATTG 
GTGATGCCGC ATGTGTCGTT 
GCAGACACTT GTCGATGCGC 
TTCTCGCCTT CGTCTCGTGG 
CATGATGCGC CGCCCGCCGC 
CGACCATGCG CCCGCCTACC 
GAACCTATCC GGGGTGTCGG 
GCGATCGGGC CCGATGCGGC 
CGCTTATGGC CCCCGCTCGC 
AAATCCTGTC GATCGGCGCC 
GCCGAAGCGG TGGCGCGGAT 
GCCCTTACTG CGCGAAGGCA 
ATTCGAACGG CATCCTCGAC 
GTCGAACGGA TCGCCCGCGA 
CCCGGTCGGC GGCGCGCAAT 
CCTTCGGCAT CGAATGGCTC 
GCGCTGAAGC CGAAACAACG 
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2) INFORMATION FOR SEQ ID NO: 1367 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 24 bases 

(B) TYPE: Nucleic acid 

(C) STRANDEDNESS : Single 

(D) TOPOLOGY: Linear 

(ii) MOLECULE TYPE: DNA 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 1367 
CAAATATACT AACAGAAGCG TTCA 



2) INFORMATION FOR SEQ ID NO: 1368 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 22 bases 

(B) TYPE: Nucleic acid 

(C) STRANDEDNESS: Single 

(D) TOPOLOGY: Linear 

(ii) MOLECULE TYPE: DNA 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 1368 
AGGATCTTGC CAATACCTTT AT 



2) INFORMATION FOR SEQ ID NO: 1369 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 24 bases 

(B) TYPE: Nucleic acid 

(C) STRANDEDNESS: Single 

(D) TOPOLOGY: Linear 

(ii) MOLECULE TYPE: DNA 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 1369 
AAACCTTTGT TTCGGTCTGC TAAT 



2) INFORMATION FOR SEQ ID NO: 1370 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 24 bases 

(B) TYPE: Nucleic acid 
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10 



25 



(C) STRANDEDNESS: Single 

(D) TOPOLOGY: Linear 

(ii) MOLECULE TYPE: DNA 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 1370 
PAGCGATTCC AATAATACCT TGCT 24 

2) INFORMATION FOR SEQ ID NO: 1371 



(i) SEQUENCE CHARACTERISTICS: 
15 (A) LENGTH: 558 bases 

(B) TYPE: Nucleic acid 

(C) STRANDEDNESS: Double 

(D) TOPOLOGY: Linear 

2 0 (ii) MOLECULE TYPE: Genomic DNA 

(vi) ORIGINAL SOURCE: 

(A) ORGANISM: CitrobactGr div^ersus 
(C) ACCESSION NUMBER: M18967 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 1371 



ATGAATTATC AAATTGTGAA TATTGCGGAA TGCAGCAATT ATCAGTTAGA 50 

AGCAGCAAAT ATACTAACAG AAGCGTTCAA TGATCTTGGT AACAATTCAT 100 

3 0 GGCCAGATAT GACGAGTGCA ACAAAAGAAG TAAAAGAATG TATTGAGAGT 150 

CCAAACCTTT GTTTCGGTCT GCTAATAAAT AACTCCTTAG TTGGCTGGAT 200 

AGGCTTAAGG CCAATGTACA AGGAAACCTG GGAATTGCAT CCATTGGTTG 250 

TCAGACCAGA TTATCAAAAT AAAGGTATTG GCAAGATCCT GCTTAAGGAA 300 

TTAGAAAACA GAGCTAGAGA GCAAGGTATT ATTGGAATCG CTTTAGGAAC 350 

35 AGATGATGAA TACTATAGAA CAAGTCTCTC TTTAATAACT ATAACAGAAG 4 00 

ATAATATATT TGATTCAATA AAAAATATTA AAAATATTAA TAAACATCCA 4 50 

TATGAGTTTT ATCAGAAGAA TGGTTATTAT ATTGTTGGAA TAATTCC/^^ 500 

TGCCAATGGT AAAAACAAAC CAGATATTTG GATGTGGAAA AGTTTAATCA 550 

AAGAGTAA 558 

40 



2) INFORMATION FOR SEQ ID NO: 1372 

45 (i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 24 bases 

(B) TYPE: Nucleic acid 

(C) STRANDEDNESS: Single 

(D) TOPOLOGY: Linear 



50 



(ii) MOLECULE TYPE: DNA 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 1372 
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GCTTTCGTTG CCTTTGCCGA GGTC 



2) INFORMATION FOR SEQ ID NO: 1373 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 23 bases 

(B) TYPE: Nucleic acid 

(C) STRANDEDNESS: Single 

(D) TOPOLOGY: Linear 

(ii) MOLECULE TYPE: DNA 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 1373 
CACCCCTGTT GCTTCGCCCA CTC 



2) INFORMATION FOR SEQ ID NO: 1374 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 24 bases 

(B) TYPE: Nucleic acid 

(C) STRANDEDNESS: Single 

(D) TOPOLOGY: Linear 

(ii) MOLECULE TYPE: DNA 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 1374 
AGATATTGGC TTCGCCGCAC CACA 



2) INFORMATION FOR SEQ ID NO: 1375 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 23 bases 

(B) TYPE: Nucleic acid 

(C) STRANDEDNESS: Single 

(D) TOPOLOGY: Linear 

(ii) MOLECULE TYPE: DNA 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 137 5 
CCCTGTTGCT TCGCCCACTC CTG 
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2) INFORMATION FOR SEQ ID NO: 1376 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 441 bases 

(B) TYPE: Nucleic acid 

(C) STRANDEDNESS: Double 

(D) TOPOLOGY: Linear 

(ii) MOLECULE TYPE: Genomic DNA 

(vi) ORIGINAL SOURCE: 

(A) ORGANISM: Serratia marcescens 
(C) ACCESSION NUMBER: M94066 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 1376 

ATGATCGTCA TCTGCGACCA CGACAACCTC GACGCCTGGC TGGCGCTGCG 50 

CACCGCGCTG TGGCCCTCCG GCTCGCCTGA AGATCACCGC GCGGAAATGC 100 

GCGAGATATT GGCTTCGCCG CACCACACCG CGTTTATGGC GCGGGGGCTG 150 

GACGGCGCTT TCGTTGCCTT TGCCGAGGTC GCGCTGCGCT ACGATTACGT 200 

CAACGGCTGC GAATCGTCGC CGGTGGCGTT TTTGGAAGGA ATTTATACCG 250 

CCGAACGCGC CCGCCGCCAG GGCTGGGCCG CGCGCCTGAT CGCGCAGGTG 300 

CAGGAGTGGG CGAAGCAACA GGGGTGCAGC GAGCTGGCGT CGGATACCGA 350 

TATCGCCAAT CTGGACTCCC AGCGCCTGCA TGCGGCGCTG GGCTTTGCCG 400 

AAACGGAGCG AGTAGTGTTT TACCGCAAAA CGCTGGGCTG A 441 



2) INFORMATION FOR SEQ ID NO: 1377 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 24 bases 

(B) TYPE: Nucleic acid 

(C) STRANDEDNESS: Single 

(D) TOPOLOGY: Linear 

(ii) MOLECULE TYPE: DNA 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 1377 
GCCGTGGGTC GATGTTTGAT GTTA 



2) INFORMATION FOR SEQ ID NO: 1378 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 24 bases 

(B) TYPE: Nucleic acid 

(C) STRANDEDNESS: Single 

(D) TOPOLOGY: Linear 

(ii) MOLECULE TYPE: DNA 
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(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 1378 
GCTCGATGAC GCCAACTACC TCTG 



2) INFORMATION FOR SEQ ID NO: 1379 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 24 bases 

(B) TYPE: Nucleic acid 

(C) STRANDEDNESS : Single 

(D) TOPOLOGY: Linear 

(ii) MOLECULE TYPE: DNA 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 1379 
AGCAGCAACG ATGTTACGCA GCAG 



2) INFORMATION FOR SEQ ID NO: 1380 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 24 bases 

(B) TYPE: Nucleic acid 

(C) STRANDEDNESS: Single 

(D) TOPOLOGY: Linear 

(ii) MOLECULE TYPE: DNA 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 1380 
CGCTCGATGA CGCCAACTAC CTCT 



2) INFORMATION FOR SEQ ID NO: 1381 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 972 bases 

(B) TYPE: Nucleic acid 

(C) STRANDEDNESS: Double 

(D) TOPOLOGY: Linear 

(ii) MOLECULE TYPE: Genomic DNA 

(vi) ORIGINAL SOURCE: 

(A) ORGANISM: Escherichia coli 
(C) ACCESSION NUMBER: X02340 
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(xi) SEQUENCE DESCRIPTION: SEQ 

GTGGTAACGG CGCAGTGGCG GTTTTCATGG 
TTTTGGGGTA CAGTCTATGC CTCGGGCATC 
CCGTGGGTCG ATGTTTGATG TTATGGAGCA 
GGCAGTCGCC CTAAAACAAA GTTAAACATC 
CGAAGTATCG ACTCAACTAT CAGAGGTAGT 
TCGAACCGAC GTTGCTGGCC GTACATTTGT 
GGCCTGAAGC CACACAGTGA TATTGATTTG 
GCTTGATGAA ACAACGCGGC GAGCTTTGAT 
CGGCTTCCCC TGGAGAGAGC GAGATTCTCC 
GTTGTGCACG ACGACATCAT TCCGTGGCGT 
GCAATTTGGA GAATGGCAGC GCAATGACAT 
CAGCCACGAT CGACATTGAT CTGGCTATCT 
CATAGCGTTG CCTTGGTAGG TCCAGCGGCG 
TCCTGAACAG GATCTATTTG AGGCGCTAAA 
ACTCGCCGCC CGACTGGGCT GGCGATGAGC 
TCCCGCATTT GGTACAGCGC AGTAACCGGC 
CGCTGCCGAC TGGGCAATGG AGCGCCTGCC 
TACTTGAAGC TAGACAGGCT TATCTTGGAC 
TCGCGCGCAG ATCAGTTGGA AGAATTTGTC 
CACCAAGGTA GTCGGCAAAT AA 



PCT/CAOO/01150 

ID NO: 1381 

CTTCTTGTTA TGACATGTTT 50 

CAAGCAGCAA GCGCGTTACG 100 

GCAACGATGT TACGCAGCAG 150 

ATGAGGGAAG CGGTGATCGC 200 

TGGCGTCATC GAGCGCCATC 250 

ACGGCTCCGC AGTGGATGGC 300 

CTGGTTACGG TGACCGTAAG 350 

CAACGACCTT TTGGAAACTT 400 

GCGCTGTAGA AGTCACCATT 450 

TATCCAGCTA AGCGCGT^CT 500 

TCTTGCAGGT ATCTTCGAGC 550 

TGCTGACAAA AGCAAGAGAA 600 

GAGGAACTCT TTGATCCGGT 650 

TGAAACCTTA ACGCTATGGA 700 

GAAATGTAGT GCTTACGTTG 750 

AAAATCGCGC CGAAGGATGT 800 

GGCCCAGTAT CAGCCCGTCA 850 

AAGAAGAAGA TCGCTTGGCC 900 

CACTACGTGA AAGGCGAGAT 950 

972 



2) INFORiyiATION FOR SEQ ID NO: 1382 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 24 bases 

(B) TYPE: Nucleic acid 

(C) STRANDEDNESS: Single 

(D) TOPOLOGY: Linear 

(ii) MOLECULE TYPE: DNA 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 1382 
TAGATATGAT AGGCGGTAAA AAGC 



2) INFORMATION FOR SEQ ID NO: 1383 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 23 bases 

(B) TYPE: Nucleic acid 

(C) STRANDEDNESS: Single 

(D) TOPOLOGY: Linear 

(ii) MOLECULE TYPE: DNA 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 1383 
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CCCAAATTCG AGTAAGAGGT ATT 
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23 



5 2) INFORMATION FOR SEQ ID NO: 1384 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 22 bases 

(B) TYPE: Nucleic acid 
10 (C) STRANDEDNESS: Single 

(D) TOPOLOGY: Linear 

(ii) MOLECULE TYPE: DNA 
15 (xi) SEQUENCE DESCRIPTION: SEQ ID NO: 1384 

GATATGATAG GCGGTAAAAA GC 22 



20 



30 



35 



50 



2) INFORMATION FOR SEQ ID NO: 1385 



(i) SEQUENCE CHARACTERISTICS: 
(A) LENGTH: 22 bases 
25 (B) TYPE: Nucleic acid 

(C) STRANDEDNESS: Single 

(D) TOPOLOGY: Linear 



(ii) MOLECULE TYPE: DNA 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 1385 
TCCCAAATTC GAGTAAGAGG TA 22 

2) INFORMATION FOR SEQ ID NO: 1386 



(i) SEQUENCE CHARACTERISTICS: 
40 (A) LENGTH: 477 bases 

(B) TYPE: Nucleic acid 

(C) STRANDEDNESS: Double 

(D) TOPOLOGY: Linear 

45 (ii) MOLECULE TYPE: Genomic DNA 

(vi) ORIGINAL SOURCE: 

(A) ORGANISM: Staphylococcus aureus 
(C) ACCESSION NUMBER: V01282 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 1386 

ATGAAAGAAA GATATGGAAC AGTATATAAA GGCTCTCAGA GGCTCATAGA 50 
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CGAGGAAAGT GGAGAAGTAA TAGAGGTAGA 
CGTCTGGTAA CTTTGTAAAA GCGTATATCG 
GATATGATAG GCGGTAAAAA GCTCAAGATT 
TGTACATCTA AGTAATAACA CAATGATAGC 
5 AAGGAACAAA TACAAGCACG AAAACCGTAA 
GAAGAAGGAA ATATCATTAA AAGAAGAACT 
AGAGCTACTC ATGAGAGGCG ATGACCAAAA 
AATTTGGGAA CTTTGAGCAA GAGGACGACC 
TCAGAATATT ATTCTTTCAA GGAGTAG 



PCT/CAOO/01150 

TAAGCTATAC CGTAAACAAA 100 

TCCAATTAAT AAGTATGTTA 150 

GTTAATTATA TATTAGATAA 200 

AACTGTTAGA GAAATAGCAG 250 

ATACAACGCT TAAAATCTTA 300 

GGAGCATTAA TGCTAAACCC 350 

ACAAAAATAC CTCTTACTCG 400 

AAAAGCAAGA AAATGCTTTA 450 

477 



2) INFORMATION FOR SEQ ID NO: 1387 

15 (i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 24 bases 

(B) TYPE: Nucleic acid 

(C) STRANDEDNESS: Single 

(D) TOPOLOGY: Linear 

20 

(ii) MOLECULE TYPE: DNA 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 1387 
25 TTATGCCTCT TCCGACCATC AAGC 24 



30 



2) INFORMATION FOR SEQ ID NO: 1388 



(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 24 bases 

(B) TYPE: Nucleic acid 

(C) STRANDEDNESS: Single 
35 (D) TOPOLOGY: Linear 

(ii) MOLECULE TYPE: DNA 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 1388 

40 

TACGCTCGTC ATCAAAATCA CTCG 24 



45 2) INFORMATION FOR SEQ ID NO: 1389 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 24 bases 

(B) TYPE: Nucleic acid 
50 (C) STRANDEDNESS: Single 

(D) TOPOLOGY: Linear 

(ii) MOLECULE TYPE: DNA 
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(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 1389 
GAATAACGGT TTGGTTGATG CGAG 24 

5 

2) INFORMATION FOR SEQ ID NO: 1390 

10 (i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 24 bases 

(B) TYPE: Nucleic acid 

(C) STRANDEDNESS: Single 

(D) TOPOLOGY: Linear 

15 

(ii) MOLECULE TYPE: DNA 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 1390 
20 ATGGCAAGAT CCTGGTATCG GTCT 24 



25 



2) INFORMATION FOR SEQ ID NO: 1391 



(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 816 bases 

(B) TYPE: Nucleic acid 

(C) STRANDEDNESS: Double 
30 (D) TOPOLOGY: Linear 

(ii) MOLECULE TYPE: Genomic DNA 

(vi) ORIGINAL SOURCE: 
35 (A) ORGANISM: Escherichia coli 

(C) ACCESSION NUMBER: J01839 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 1391 

40 ATGAGCCATA TTCAACGGGA AACGTCTTGC TCGAGGCCGC GATTAAATTC 50 

CAACATGGAT GCTGATTTAT ATGGGTATAA ATGGGCTCGC GATAATGTCG 100 

GGCAATCAGG TGCGACAATC TATCGATTGT ATGGGAAGCC CGATGCGCCA 150 

GAGTTGTTTC TGAAACATGG CAAAGGTAGC GTTGCCAATG ATGTTACAGA 200 

TGAGATGGTC AGACTAAACT GGCTGACGGA ATTTATGCCT CTTCCGACCA 250 

45 TCAAGCATTT TATCCGTACT CCTGATGATG CATGGTTACT CACCACTGCG 300 

ATCCCCGGGA AAACAGCATT CCAGGTATTA GAAGAATATC CTGATTCAGG 350 

TGAAAATATT GTTGATGCGC TGGCAGTGTT CCTGCGCCGG TTGCATTCGA 400 

TTCCTGTTTG TAATTGTCCT TTTAACAGCG ATCGCGTATT TCGTCTCGCT 450 

CAGGCGCAAT CACGAATGAA TAACGGTTTG GTTGATGCGA GTGATTTTGA 500 

50 TGACGAGCGT T^TGGCTGGC CTGTTG7\ACA AGTCTGGAAA GAAATGCATA 550 

AGCTTTTGCC ATTCTCACCG GATTCAGTCG TCACTCATGG TGATTTCTCA 600 

CTTGATAACC TTATTTTTGA CGAGGGGAAA TTAATAGGTT GTATTGATGT 650 

TGGACGAGTC GGT^ATCGCAG ACCGATACCA GGATCTTGCC ATCCTATGGA 700 
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ACTGCCTCGG TGAGTTTTCT CCTTCATTAC AGAAACGGCT TTTTCAAAAA 750 
TATGGTATTG ATAATCCTGA TATGAATAAA TTGCAGTTTC ATTTGATGCT 800 
CGATGAGTTT TTCTAA 816 



2) INFORMATION FOR SEQ ID NO: 1392 

(i) SEQUENCE CHARACTERISTICS: 
10 (A) LENGTH: 24 bases 

(B) TYPE: Nucleic acid 

(C) STRANDEDNESS: Single 

(D) TOPOLOGY: Linear 

15 (ii) MOLECULE TYPE: DNA 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 1392 
TGGGTGGAGA GGCTATTCGG CTAT 24 

20 

2) INFORMATION FOR SEQ ID NO: 1393 

25 (i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 23 bases 

(B) TYPE: Nucleic acid 

(C) STRANDEDNESS: Single 

(D) TOPOLOGY: Linear 

30 

(ii) MOLECULE TYPE: DNA 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 1393 
35 CAGTCCCTTC CCGCTTCAGT GAG 23 



40 



2) INFORMATION FOR SEQ ID NO: 1394 



(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 24 bases 

(B) TYPE: Nucleic acid 

(C) STRANDEDNESS: Single 
45 (D) TOPOLOGY: Linear 

(ii) MOLECULE TYPE: DNA 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 1394 

50 

GACGTTGTCA CTGAAGCGGG AAGG 24 
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2) INFORMATION FOR SEQ ID NO: 1395 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 24 bases 

(B) TYPE: Nucleic acid 

(C) STRANDEDNESS: Single 

(D) TOPOLOGY: Linear 

(ii) MOLECULE TYPE: DNA 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 1395 
CTTGGTGGTC GAATGGGCAG GTAG 



2) INFORMATION FOR SEQ ID NO: 1396 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 7 95 bases 

(B) TYPE: Nucleic acid 

(C) STRANDEDNESS: Double 

(D) TOPOLOGY: Linear 

(ii) MOLECULE TYPE: Genomic DNA 

(vi) ORIGINAL SOURCE: 

(A) ORGANISM: Escherichia coli 
(C) ACCESSION NUMBER: V00618 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 1396 

ATGATTGAAC AAGATGGATT GCACGCAGGT TCTCCGGCCG CTTGGGTGGA 50 

GAGGCTATTC GGCTATGACT GGGCACAACA GACAATCGGC TGCTCTGATG 100 

CCGCCGTGTT CCGGCTGTCA GCGCAGGGGC GCCCGGTTCT TTTTGTCAAG 150 

ACCGACCTGT CCGGTGCCCT GAATGAACTG CAGGACGAGG CAGCGCGGCT 200 

ATCGTGGCTG GCCACGACGG GCGTTCCTTG CGCAGCTGTG CTCGACGTTG 250 

TCACTGAAGC GGGAAGGGAC TGGCTGCTAT TGGGCGAAGT GCCGGGGCAG 300 

GATCTCCTGT CATCTCACCT TGCTCCTGCC GAGAAAGTAT CCATCATGGC 350 

TGATGCAATG CGGCGGCTGC ATACGCTTGA TCCGGCTACC TGCCCATTCG 4 00 

ACCACCAAGC GAAACATCGC ATCGAGCGAG CACGTACTCG GATGGAAGCC 4 50 

GGTCTTGTCG ATCAGGATGA TCTGGACGAA GAGCATCAGG GGCTCGCGCC 500 

AGCCGAACTG TTCGCCAGGC TCAAGGCGCG CATGCCCGAC GGCGAGGATC 550 

TCGTCGTGAC CCATGGCGAT GCCTGCTTGC CGAATATCAT GGTGGAAAAT 600 

GGCCGCTTTT CTGGATTCAT CGACTGTGGC CGGCTGGGTG TGGCGGACCG 650 

CTATCAGGAC ATAGCGTTGG CTACCCGTGA TATTGCTGAA GAGCTTGGCG 700 

GCGAATGGGC TGACCGCTTC CTCGTGCTTT ACGGTATCGC CGCTCCCGAT 7 50 

TCGCAGCGCA TCGCCTTCTA TCGCCTTCTT GACGAGTTCT TCTGA 7 95 



2) INFORMATION FOR SEQ ID NO: 1397 
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(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 22 bases 

(B) TYPE: Nucleic acid 

(C) STRANDEDNESS: Single 

(D) TOPOLOGY: Linear 

(ii) MOLECULE TYPE: DNA 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 1397 
GTGGGAGAAA ATGAAAACCT AT 



2) INFORMATION FOR SEQ ID NO: 1398 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 20 bases 

(B) TYPE: Nucleic acid 

(C) STRANDEDNESS: Single 

(D) TOPOLOGY: Linear 

(ii) MOLECULE TYPE: DNA 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 1398 
ATGGAGTGAA AGAGCCTGAT 



2) INFORMATION FOR SEQ ID NO: 1399 

(i) SEQUENCE CHARACTERISTICS:' 

(A) LENGTH: 24 bases 

(B) TYPE: Nucleic acid 

(C) STRANDEDNESS: Single 

(D) TOPOLOGY: Linear 

(ii) MOLECULE TYPE: DNA 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 1399 
ACCTATGATG TGGAACGGGA AAAG 



2) INFORMATION FOR SEQ ID NO: 1400 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 23 bases 

(B) TYPE: Nucleic acid 

(C) STRANDEDNESS: Single 

(D) TOPOLOGY: Linear 
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5 



25 



(ii) MOLECULE TYPE: DNA 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 1400 
CGATGGAGTG AAAGAGCCTG ATG 23 



10 2) INFORMATION FOR SEQ ID NO: 1401 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 7 95 bases 

(B) TYPE: Nucleic acid 
15 (C) STRANDEDNESS: Double 

(D) TOPOLOGY: Linear 

(ii) MOLECULE TYPE: Genomic DNA 

2 0 (vi) ORIGINAL SOURCE: 

(A) ORGANISM: Enterococcus faecalis 

(C) ACCESSION NUMBER: V01547 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 1401 



ATGGCTAAAA TGAGAATATC ACCGGAATTG AAAAAACTGA TCGAAAAATA 50 

CCGCTGCGTA AAAGATACGG AAGGAATGTC TCCTGCTAAG GTATATAAGC 100 

TGGTGGGAGA AAATGAAAAC CTATATTTAA AAATGACGGA CAGCCGGTAT 150 

AAAGGGACCA CCTATGATGT GGAACGGGAA AAGGACATGA TGCTATGGCT 200 

30 GGAAGGAAAG CTGCCTGTTC CAAAGGTCCT GCACTTTGAA CGGCATGATG 250 

GCTGGAGCAA TCTGCTCATG AGTGAGGCCG ATGGCGTCCT TTGCTCGGAA 300 

GAGTATGAAG ATGAACAAAG CCCTGAAAAG ATTATCGAGC TGTATGCGGA 350 

GTGCATCAGG CTCTTTCACT CCATCGACAT ATCGGATTGT CCCTATACGA 4 00 

ATAGCTTAGA CAGCCGCTTA GCCGAATTGG ATTACTTACT GAATAACGAT 4 50 

35 CTGGCCGATG TGGATTGCGA AAACTGGGAA GAAGACACTC CATTTAAAGA 500 

TCCGCGCGAG CTGTATGATT TTTTAAAGAC GGAAAAGCCC GAAGAGGAAC 550 

TTGTCTTTTC CCACGGCGAC CTGGGAGACA GCAACATCTT TGTGAAAGAT 600 

GGCAAAGTAA GTGGCTTTAT TGATCTTGGG AGAAGCGGCA GGGCGGACAA 650 

GTGGTATGAC ATTGCCTTCT GCGTCCGGTC GATCAGGGAG GATATCGGGG 700 

40 AAG7VACAGTA TGTCGAGCTA TTTTTTGACT TACTGGGGAT CAAGCCTGAT 750 

TGGGAGAAAA TAA7\ATATTA TATTTTACTG GATGAATTGT TTTAG 7 95 



45 2) INFORMATION FOR SEQ ID NO: 1402 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 24 bases 

(B) TYPE: Nucleic acid 
50 (C) STRANDEDNESS: Single 

(D) TOPOLOGY: Linear 

(ii) MOLECULE TYPE: DNA 
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(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 1402 
TATTCAACAA TTTATCGGAA ACAG 



2) INFORMATION FOR SEQ ID NO: 1403 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 23 bases 

(B) TYPE: Nucleic acid 

(C) STRANDEDNESS: Single 

(D) TOPOLOGY: Linear 

(ii) MOLECULE TYPE: DNA 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 1403 

TCAGAGAGCC AACTCAACAT TTT 



2) INFORMATION FOR SEQ ID NO: 1404 

(i) SEQUENCE CHARACTERISTICS: 
(A) LENGTH: 24 bases 
{B) TYPE: Nucleic acid 

(C) STRANDEDNESS: Single 

(D) TOPOLOGY: Linear 

(ii) MOLECULE TYPE: DNA 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 1404 
AAACAGCGTT TTAGAGCCAA ATAA 



2) INFORMATION FOR SEQ ID NO: 1405 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 24 bases 

(B) TYPE: Nucleic acid 

(C) STRANDEDNESS: Single 

(D) TOPOLOGY: Linear 

(ii) MOLECULE TYPE: DNA 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 1405 
TTCTCAGAGA GCCAACTCAA CATT 
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15 



35 



(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 780 bases 

(B) TYPE: Nucleic acid 

(C) STRANDEDNESS: Double 

(D) TOPOLOGY: Linear 

(ii) MOLECULE TYPE: Genomic DNA 

(vi) ORIGINAL SOURCE: 

(A) ORGANISM: AcinetoJbacter baumannii 

(B) STRAIN: BM2580 

(C) ACCESSION NUMBER: X07753 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 1406 



ATGGAATTGC CCAATATTAT TCAACAATTT ATCGGAAACA GCGTTTTAGA 50 

GCCAAATAAA ATTGGTCAGT CGCCATCGGA TGTTTATTCT TTTAATCGAA 100 

20 ATAATGAAAC TTTTTTTCTT AAGCGATCTA GCACTTTATA TACAGAGACC 150 

ACATACAGTG TCTCTCGTGA AGCGAAAATG TTGAGTTGGC TCTCTGAGAA 200 

ATTAAAGGTG CCTGAACTCA TCATGACTTT TCAGGATGAG CAGTTTGAAT 250 

TCATGATCAC TAAAGCGATC AATGCAAAAC CAATTTCAGC GCTTTTTTTA 300 

ACAGACCAAG AATTGCTTGC TATCTATAAG GAGGCACTCA ATCTGTTAAA 350 

25 TTCAATTGCT ATTATTGATT GTCCATTTAT TTCAAACATT GATCATCGGT 400 

TAAAAGAGTC AAAATTTTTT ATTGATAACC AACTCCTTCSA CGATATAGAT 450 

CAAGATGATT TTGACACTGA ATTATGGGGA GACCATAAAA CTTACCTAAG 500 

TCTATGGAAT GAGTTAACCG AGACTCGTGT TGAAGAAAGA TTGGTTTTTT 550 

CTCATGGCGA TATCACGGAT AGTAATATTT TTATAGATAA ATTCAATGAA 600 

3 0 ATTTATTTTT TAGATCTTGG TCGTGCTGGG TTAGCAGATG AATTTGTAGA 650 

TATATCCTTT GTTGAACGTT GCCTAAGAGA GGATGCATCG GAGGAAACTG 700 

CGAAAATATT TTTAAAGCAT TTAAAAAATG ATAGACCTGA CAAAAGGAAT 7 50 

TATTTTTTAA AACTTGATGA ATTGAATTGA 7 80 



2) INFORMATION FOR SEQ ID NO: 1407 



(i) SEQUENCE CHARACTERISTICS: 
40 (A) LENGTH: 24 bases 

(B) TYPE: Nucleic acid 

(C) STRANDEDNESS: Single 

(D) TOPOLOGY: Linear 

45 (ii) MOLECULE TYPE: DNA 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 1407 
CCCTGTAATA GAAAAGCAAG TAGG 24 

50 



2) INFORMATION FOR SEQ ID NO: 1408 
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(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 22 bases 

(B) TYPE: Nucleic acid 

(C) STFIANDEDNESS: Single 

(D) TOPOLOGY: Linear 

(ii) MOLECULE TYPE: DNA 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 1408 
TTGTCGTATC CCTCAAATCA CC 



2) INFORMATION FOR SEQ ID NO: 1409 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 23 bases 

(B) TYPE: Nucleic acid 

(C) STRANDEDNESS: Single 

(D) TOPOLOGY: Linear 

(ii) MOLECULE TYPE: DNA 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 1409 
TGGGATTACA ATGGCAATCA GCG 



2) INFORMATION FOR SEQ ID NO: 1410 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 25 bases 

(B) TYPE: Nucleic acid 

(C) STRANDEDNESS: Single 

(D) TOPOLOGY: Linear 

(ii) MOLECULE TYPE: DNA 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 1410 
GGGGAATAGG TCACAAGATC TGCTT 



2) INFORMATION FOR SEQ ID NO: 1411 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 912 bases 

(B) TYPE: Nucleic acid 

(C) STRANDEDNESS: Double 
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(D) TOPOLOGY: Linear 

(ii) MOLECULE TYPE: Genomic DNA 

(vi) ORIGINAL SOURCE: 

(A) ORGANISM: Pseudomonas aeruginosa 
(C) ACCESSION NUMBER: S46063 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 1411 



ATGCTTTTAT ATAAAATGTG TGACAATCAA AATTATGGGG TTACTTACAT 50 

GAAGTTTTTA TTGGCATTTT CGCTTTTAAT ACCATCCGTG GTTTTTGCAA 100 

GTAGTTCAAA GTTTCAGCAA GTTGAACAAG ACGTTAAGGC AATTGAAGTT 150 

TCTCTTTCTG CTCGTATAGG TGTTTCCGTT CTTGATACTC AATVATGGAGA 200 

15 ATATTGGGAT TACAATGGCA ATCAGCGCTT CCCGTTAACA AGTACTTTTA 250 

AAACAATAGC TTGCGCTAAA TTACTATATG ATGCTGAGCA AGGAAAAGTT 300 

AATCCCAATA GTACAGTCGA GATTAAGAAA GCAGATCTTG TGACCTATTC 350 

CCCTGTAATA GAAAAGCAAG TAGGGCAGGC AATCACACTC GATGATGCGT 400 

GCTTCGCAAC TATGACTACA AGTGATTU^TA CTGCGGCAAA TATCATCCTA 4 50 

20 AGTGCTGTAG GTGGCCCCAA AGGCGTTACT GATTTTTTAA GACAAATTGG 500 

GGACAAAGAG ACTCGTCTAG ACCGTATTGA GCCTGATTTA AATGAAGGTA 550 

AGCTCGGTGA TTTGAGGGAT ACGACAACTC CTAAGGCAAT AGCCAGTACT 600 

TTGAATAAAC TTTTATTTGG TTCCGCGCTA TCTGAAATGA ACCAGAAAAA 650 

ATTAGAGTCT TGGATGGTGA ACAATCAAGT CACTGGTAAT TTACTACGTT 700 

25 CAGTATTGCC GGCGGGATGG AACATTGCGG ATCGCTCAGG TGCTGGCGGA 7 50 

TTTGGTGCTC GGAGTATTAC AGCAGTTGTG TGGAGTGAGC ATCAAGCCCC 800 

AATTATTGTG AGCATCTATC TAGCTCAAAC ACAGGCTTCA ATGGCAGAGC 850 

GAAATGATGC GATTGTTAAA ATTGGTCATT CAATTTTTGA CGTTTATACA 900 

TCACAGTCGC GC 912 

30 



2) INFORMATION FOR SEQ ID NO: 1412 

35 (i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 21 bases 

(B) TYPE: Nucleic acid 

(C) STRANDEDNESS: Single 

(D) TOPOLOGY: Linear 

40 

(ii) MOLECULE TYPE: DNA 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 1412 
45 GAGAAAACGC TCCAGCAGGG C 21 



2) INFORMATION FOR SEQ ID NO: 1413 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 21 bases 

(B) TYPE: Nucleic acid 
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(C) STRANDEDNESS : Single 

(D) TOPOLOGY: Linear 

(ii) MOLECULE TYPE: DNA 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 1413 
CATGAGGCTT TCACTGCGGG G 21 



2) INFORMATION FOR SEQ ID NO: 1414 



(i) SEQUENCE CHARACTERISTICS: 
15 (A) LENGTH: 21 bases 

(B) TYPE: Nucleic acid 

(C) STRANDEDNESS: Single 

(D) TOPOLOGY: Linear 

20 (ii) MOLECULE TYPE: DNA 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 1414 
TATCGTTAAT CGCACCATCA C 21 

25 

2) INFORMATION FOR SEQ ID NO: 1415 

30 (i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 21 bases 

(B) TYPE: Nucleic acid 

(C) STRANDEDNESS: Single 

(D) TOPOLOGY: Linear 

35 

(ii) MOLECULE TYPE: DNA 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 1415 
40 ATGCAGTAAT GCGGCTTTAT C 21 



2) INFORMATION FOR SEQ ID NO: 1416 



(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 114 6 bases 

(B) TYPE: Nucleic acid 

(C) STRANDEDNESS: Double 
50 (D) TOPOLOGY: Linear 

(ii) MOLECULE TYPE: Genomic DNA 
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(vi) ORIGINAL SOURCE: 

(A) ORGANISM: Klebsiella pneumoniae 

(B) STRAIN: HEL-1 

(C) ACCESSION NUMBER: X91840 

5 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 1416 

ATGATGAAAA AATCGTTATG CTGCGCTCTG CTGCTGACAG CCTCTTTCTC 50 

CACATTTGCT GCCGCAAAAA CAGAACAACA GATTGCCGAT ATCGTTAATC 100 

10 GCACCATCAC CCCGTTGATG CAGGAGCAGG CTATTCCGGG TATGGCCGTT 150 

GCCGTTATCT ACCAGGGAAA ACCCTATTAT TTCACCTGGG GTAAAGCCGA 200 

TATCGCCAAT AACCACCCAG TCACGCAGCA AACGCTGTTT GAGCTAGGAT 250 

CGGTTAGTAA GACGTTTAAC GGCGTGTTGG GCGGCGATGC TATCGCCCGC 300 

GGCGAAATTA AGCTCAGCGA TCCGGTCACG AAATACTGGC CAGAACTGAC 350 

15 AGGCAAACAG TGGCAGGGTA TCCGCCTGCT GCACTTAGCC ACCTATACGG 400 

CAGGCGGCCT ACCGCTGCAG ATCCCCGATG ACGTTAGGGA TAAAGCCGCA 4 50 

TTACTGCATT TTTATCAAAA CTGGCAGCCG CAATGGACTC CGGGCGCTAA 500 

GCGACTTTAC GCTAACTCCA GCATTGGTCT GTTTGGCGCG CTGGCGGTGA 550 

AACCCTCAGG AATGAGTTAC GAAGAGGCAA TGACCAGACG CGTCCTGCAA 600 

20 CCATTAAAAC TGGCGCATAC CTGGATTACG GTTCCGCAGA ACGAACAAAA 650 

AGATTATGCC TGGGGCTATC GCGAAGGGAA GCCCGTACAC GTTTCTCCGG 700 

GACAACTTGA CGCCGAAGCC TATGGCGTGA AATCCAGCGT TATTGATATG 750 

GCCCGCTGGG TTCAGGCCAA CATGGATGCC AGCCACGTTC AGGAGAAAAC 800 

GCTCCAGCAG GGCATTGCGC TTGCGCAGTC TCGCTACTGG CGTATTGGCG 850 

25 ATATGTACCA GGGATTAGGC TGGGAGATGC TGAACTGGCC GCTGAAAGCT 900 

GATTCGATCA TCAACGGCAG CGACAGCAAA GTGGCATTGG CAGCGCTTCC 950 

CGCCGTTGAG GTAAACCCGC CCGCCCCCGC AGTGAAAGCC TCATGGGTGC 1000 

ATAAAACGGG CTCCACTGGT GGATTTGGCA GCTACGTAGC CTTCGTTCCA 1050 

GAAAAAAACC TTGGCATCGT GATGCTGGCA AACAAAAGCT ATCCTAACCC 1100 

30 TGTCCGTGTC GAGGCGGCCT GGCGCATTCT TGAAAAGCTG CAATAA 114 6 



2) INFORMATION FOR SEQ ID NO: 1417 



35 



(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 25 bases 

(B) TYPE: Nucleic acid 

(C) STRANDEDNESS: Single 
40 (D) TOPOLOGY: Linear 

(ii) MOLECULE TYPE: DNA 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 1417 

45 

TGGTTAACTA YAATCCSATT GCGGA 25 



50 2) INFORMATION FOR SEQ ID NO: 1418 

(i) SEQUENCE CHARACTERISTICS: 
(A) LENGTH: 22 bases 
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(B) TYPE: Nucleic acid 

(C) STRANDEDNESS: Single 

(D) TOPOLOGY: Linear 

(ii) MOLECULE TYPE: DNA 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 1418 
ATGCTTTACC CAGCGTCAGA TT 



2) INFORMATION FOR SEQ ID NO: 1419 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 24 bases 

(B) TYPE: Nucleic acid 

(C) STRANDEDNESS: Single 

(D) TOPOLOGY: Linear 

(ii) MOLECULE TYPE: DNA 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 1419 
CGATGAATAA GCTGATTTCT CACG 



2) INFORMATION FOR SEQ ID NO: 1420 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 24 bases 

(B) TYPE: Nucleic acid 

(C) STRANDEDNESS: Single 

(D) TOPOLOGY: Linear 

(ii) MOLECULE TYPE: DNA 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 1420 
TGCTTTACCC AGCGTCAGAT TACG 



2) INFOimATION FOR SEQ ID NO: 1421 

(i) SEQUENCE CHAE?ACTERISTICS : 

(A) LENGTH: 24 bases 

(B) TYPE: Nucleic acid 

(C) STRANDEDNESS: Single 

(D) TOPOLOGY: Linear 

(ii) MOLECULE TYPE: DNA 
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(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 1421 
AATTAGAGCG GCAGTCGGGA GGAA 24 

5 

2) INFORMATION FOR SEQ ID NO: 1422 

10 (i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 24 bases 

(B) TYPE: Nucleic acid 

(C) STRANDEDNESS: Single 

(D) TOPOLOGY: Linear 

15 

(ii) MOLECULE TYPE: DNA 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 1422 
20 GAAATCAGCT TATTCATCGC CACG 24 



25 



40 



2) INFORMATION FOR SEQ ID NO: 1423 



(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 876 bases 

(B) TYPE: Nucleic acid 

(C) STRANDEDNESS: Double 
30 (D) TOPOLOGY: Linear 

(ii) MOLECULE TYPE: Genomic DNA 

(vi) ORIGINAL SOURCE: 
35 (A) ORGANISM: Escherichia coli 

(B) STRAIN: GRI-1 

(C) ACCESSION NUMBER: X92506 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 1423 



ATGGTTAAAA AATCACTGCG TCAGTTCACG CTGATGGCGA CGGCAACCGT 50 

CACGCTGTTG TTAGGAAGTG TGCCGCTGTA TGCGCAAACG GCGGACGTAC 100 

AGCAAA7y\CT TGCCGAATTA GAGCGGCAGT CGGGAGGAAG ACTGGGTGTG 150 

GCATTGATTA ACACAGCAGA TT^TTCGCAA ATACTTTATC GTGCTGATGA 2 00 

45 GCGCTTTGCG ATGTGCAGCA CCAGTAAAGT GATGGCCGTG GCCGCGGTGC 250 

TGAAGAAAAG TGAAAGCGAA CCGAATCTGT TAAATCAGCG AGTTGAGATC 300 

AAAAAATCTG ACTTGGTTAA CTATAATCCG ATTGCGGAAA AGCACGTCGA 350 

TGGGACGATG TCACTGGCTG AGCTTAGCGC GGCCGCGCTA CAGTACAGCG 400 

ATAACGTGGC GATGAATAAG CTGATTTCTC ACGTTGGCGG CCCGGCTAGC 4 50 

50 GTCACCGCGT TCGCCCGACA GCTGGGAGAC GAAACGTTCC GTCTCGACCG 500 

TACCGAGCCG ACGTTAAACA CCGCCATTCC GGGCGATCCG CGTGATAGCA 550 

CTTCACCTCG GGCAATGGCG CAAACTCTGC GTAATCTGAC GCTGGGTAAA 600 

GCATTGGGTG ACAGCCAACG GGCGCAGCTG GTGACATGGA TGAAAGGCAA 650 
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TACCACCGGT GCAGCGAGCA TTCAGGCTGG ACTGCCTGCT TCCTGGGTTG 700 

TGGGGGATAA AACCGGCAGC GGTGACTATG GCACCACCAA CGATATCGCG 750 

GTGATCTGGC CAAAAGATCG TGCGCCGCTG ATTCTGGTCA CTTACTTCAC 800 

CCAGCCTCAA CCTAAGGCAG AAAGCCGTCG CGATGTATTA GCGTCGGCGG 850 

CTAAAATCGT CACCAACGGT TTGTAA 876 



2) INFORMATION FOR SEQ ID NO: 1424 



(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 23 bases 

(B) TYPE: Nucleic acid 

(C) STRANDEDNESS: Single 
15 (D) TOPOLOGY: Linear 

(ii) MOLECULE TYPE: DNA 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 1424 

20 

GTTAACGGTG ATGGCGACGC TAG 23 



25 2) INFORMATION FOR SEQ ID NO: 1425 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 24 bases 

(B) TYPE: Nucleic acid 
30 (C) STRANDEDNESS: Single 

(D) TOPOLOGY: Linear 

(ii) MOLECULE TYPE: DNA 
35 (xi) SEQUENCE DESCRIPTION: SEQ ID NO: 1425 

GAATTATCGG CGGTGTTAAT CAGC 24 



40 



50 



2) INFORMATION FOR SEQ ID NO: 1426 



(i) SEQUENCE CHARACTERISTICS: 
(A) LENGTH: 22 bases 
45 (B) TYPE: Nucleic acid 

(C) STRANDEDNESS: Single 

(D) TOPOLOGY: Linear 



(ii) MOLECULE TYPE: DNA 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 1426 

CACGCTCAAT ACCGCCATTC CA 22 
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2) INFORMATION FOR SEQ ID NO: 1427 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 24 bases 

(B) TYPE: Nucleic acid 

(C) STRANDEDNESS : Single 

(D) TOPOLOGY: Linear 

(ii) MOLECULE TYPE: DNA 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 1427 
TTATCGCCCA CTACCCATGA TTTC 24 



2) INFORMATION FOR SEQ ID NO: 1428 



(i) SEQUENCE CHARACTERISTICS: 
20 (A) LENGTH: 876 bases 

(B) TYPE: Nucleic acid 

(C) STRANDEDNESS: Double 

(D) TOPOLOGY: Linear 

25 (ii) MOLECULE TYPE: Genomic DNA 

(vi) ORIGINAL SOURCE: 

(A) ORGANISM: Salmonella typhimurium 

(B) STRAIN: CAS-5 

30 (C) ACCESSION NUMBER: X92507 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 1428 

ATGATGACTC AGAGCATTCG CCGCTCAATG TTAACGGTGA TGGCGACGCT 50 

35 ACCCCTGCTA TTTAGCAGCG CAACGCTGCA TGCGCAGGCG " AACAGCGTGC 100 

AACAGCAGCT GGAAGCCCTG GAGAAAAGTT CGGGAGGTCG GCTTGGCGTT 150 

GCGCTGATTA ACACCGCCGA TAATTCGCAG ATTCTCTACC GTGCCGATGA 200 

ACGTTTTGCG ATGTGCAGTA CCAGTAAGGT GATGGCGGCC GCGGCGGTGC 250 

TTAAACAGAG CGAGAGCGAT AAGCACCTGC TAAATCAGCG CGTTGAAATC 300 

40 AAGAAGAGCG ACCTGGTTAA CTACAATCCC ATTGCGGAGA AACACGTTAA 350 

CGGCACGATG ACGCTGGCTG AGCTTGGCGC AGCGGCGCTG CAGTATAGCG 400 

ACAATACTGC CATGAATAAG CTGATTGCCC ATCTGGGTGG TCCCGATAAA 4 50 

GTGACGGCGT TTGCTCGCTC GTTGGGTGAT GAGACCTTCC GTCTGGACAG 500 

AACCGAGCCC ACGCTCAATA CCGCCATTCC AGGCGACCCG CGTGATACCA 550 

45 CCACGCCGCT CGCGATGGCG CAGACCCTGA AAAATCTGAC GCTGGGTAAA 600 

GCGCTGGCGG AAACTCAGCG GGCACAGTTG GTGACGTGGC TTAAGGGCAA 650 

TACTACCGGT AGCGCGAGCA TTCGGGCGGG TCTGCCGAAA TCATGGGTAG 700 

TGGGCGATAA AACCGGCAGC GGAGATTATG GCACCACCAA CGATATCGCG 750 

GTTATCTGGC CGGAAAACCA CGCACCGCTG GTTCTGGTGA CCTACTTTAC 800 

50 CCAACCGGAG CAGAAGGCGG AAAGCCGTCG GGATATTCTG GCTGCGGCGG 850 

CGAAAATCGT AACCCACGGT TTCTGA 876 
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2) INFORMATION FOR SEQ ID NO: 1429 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 25 bases 

(B) TYPE: Nucleic acid 

(C) STRANDEDNESS: Single 

(D) TOPOLOGY: Linear 

(ii) MOLECULE TYPE: DNA 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 1429 
TTTACGGCTA AAGATACTGA AAAGT 25 



2) INFORMATION FOR SEQ ID NO: 1430 



(i) SEQUENCE CHARACTERISTICS: 
20 (A) LENGTH: 26 bases 

(B) TYPE: Nucleic acid 

(C) STRANDEDNESS: Single 

(D) TOPOLOGY: Linear 

25 (ii) MOLECULE TYPE: DNA 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 1430 
GTTTAATAAA ACAACCACCG AATAAT 26 

30. 

2) INFORMATION FOR SEQ ID NO: .1431 

35 (i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 25 bases 

(B) TYPE: Nucleic acid 

(C) STRANDEDNESS: Single 

(D) TOPOLOGY: Linear 

40 

(ii) MOLECULE TYPE: DNA 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 1431 
45 TAATTGACAC TCCATTTACG GCTAA 25 



2) INFORMATION FOR SEQ ID NO: 1432 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 26 bases 

(B) TYPE: Nucleic acid 
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(C) STRANDEDNESS: Single 

(D) TOPOLOGY: Linear 



(ii) MOLECULE TYPE: DNA 



5 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 1432 



ACCGAATAAT ATTTTCCTTT CAGGCA 



10 



2) INFORMATION FOR SEQ ID NO: 1433 



(i) 



SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 741 bases 

(B) TYPE: Nucleic acid 

(C) STEIANDEDNESS : Double 

(D) TOPOLOGY: Linear 



15 



20 



(ii) 



MOLECULE T.Y.PE: Genomic DNA 



(vi) 



ORIGINAL SOURCE: 

(A) ORGANISM: Pseudomonas aeruginosa 
(C) ACCESSION NUMBER: AJ223604 



25 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 1433 

ATGAGCAAGT TATCTGTATT CTTTATATTT TTGTTTTGCA GCATTGCTAC 
CGCAGCAGAG TCTTTGCCAG ATTTAAT^AAT TGAAAAGCTT GATGAAGGCG 

30 TTTATGTTCA TACTTCGTTT GAAGAAGTTA ACGGGTGGGG CGTTGTTCCT 
AAACATGGTT TGGTGGTTCT TGTAAATGCT GAGGCTTACC TAATTGACAC 
TCCATTTACG GCTAAAGATA CTGAAAAGTT AGTCACTTGG TTTGTGGAGC 
GTGGCTATAA AATAAAAGGC AGCATTTCCT CTCATTTTCA TAGCGACAGC 
ACGGGCGGAA TAGAGTGGCT TAATTCTCGA TCTATCCCCA CGTATGCATC 

35 TGAATTAACA AATGAACTGC TTAAAAAAGA CGGTAAGGTT CAAGCCACAA 
ATTCATTTAG CGGAGTTAAC TATTGGCTAG TTAAAAATAA AATTGAAGTT 
TTTTATCCAG GCCCGGGACA CACTCCAGAT AACGTAGTGG TTTGGTTGCC 
TGAAAGGAAA ATATTATTCG GTGGTTGTTT TATTAAACCG TACGGTTTAG 
GCAATTTGGG TGACGCAAAT ATAGAAGCTT GGCCAAAGTC CGCCAAATTA 

40 TTAAAGTCCA AATATGGTAA GGCAAAACTG GTTGTTCCAA GTCACAGTGA 
AGTTGGAGAC GCATCACTCT TGAAACTTAC ATTAGAGCAG GCGGTTAAAG 
GGTTAAACGA AAGTAAAAAA CCATCAAAAC CAAGCAACTA A 



45 

2) INFORMATION FOR SEQ ID NO: 1434 



50 
100 
150 
200 
250 
300 
350 
400 
450 
500 
550 
600 
650 
700 
741 



50 



(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 25 bases 

(B) TYPE: Nucleic acid 

(C) STRANDEDNESS: Single 

(D) TOPOLOGY: Linear 
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(ii) MOLECULE TYPE: DNA 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 1434 
CACAATCAAG ACCAAGATTT GCGAT 



2) INFORMATION FOR SEQ ID NO: 1435 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 25 bases 

(B) TYPE: Nucleic acid 

(C) STRANDEDNESS: Single 

(D) TOPOLOGY: Linear 

(ii) MOLECULE TYPE: DNA 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 1435 
GAAAGGGCAG CTCGTTACGA TAGAG 



2) INFORMATION FOR SEQ ID NO: 1436 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 24 bases 

(B) TYPE: Nucleic acid 

(C) STRANDEDNESS: Single 

(D) TOPOLOGY: Linear 

(ii) MOLECULE TYPE: DNA 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 1436 
CAGCATCAAC ATTTAAGATC CCCA 



2) INFORMATION FOR SEQ ID NO: 1437 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 26 bases 

(B) TYPE: Nucleic acid 

(C) STRANDEDNESS: Single 

(D) TOPOLOGY: Linear 

(ii) MOLECULE TYPE: DNA 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 1437 
CTCCACTTGA TTAACTGCGG AAATTC 
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2) INFORMATION FOR SEQ ID NO: 1438 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 828 bases 

(B) TYPE: Nucleic acid 

(C) STRANDEDNESS: Double 

(D) TOPOLOGY: Linear 

(ii) MOLECULE TYPE: Genomic DNA 

(vi) ORIGINAL SOURCE: 

(A) ORGANISM: Escherichia coli 
(C) ACCESSION NUMBER: X06046 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 1438 

ATGGCAATCC GAATCTTCGC GATACTTTTC TCCATTTTTT CTCTTGCCAC 50 

TTTCGCGCAT GCGCAAGAAG GCACGCTAGA ACGTTCTGAC TGGAGGAAGT 100 

TTTTCAGCGA ATTTCAAGCC AAAGGCACGA TAGTTGTGGC AGACGAACGC 150 

CAAGCGGATC GTGCCATGTT GGTTTTTGAT CCTGTGCGAT CGAAGAAACG 200 

CTACTCGCCT GCATCGACAT TCAAGATACC TCATACACTT TTTGCACTTG 250 

ATGCAGGCGC TGTTCGTGAT GAGTTCCAGA TTTTTCGATG GGACGGCGTT 300 

AACAGGGGCT TTGCAGGCCA CAATCAAGAC CAAGATTTGC GATCAGCAAT 350 

GCGGAATTCT ACTGTTTGGG TGTATGAGCT ATTTGCAAAG GAAATTGGTG 400 

ATGACAAAGC TCGGCGCTAT TTGAAGAAAA TCGACTATGG CAACGCCGAT 4 50 

CCTTCGACAA GTAATGGCGA TTACTGGATA GAAGGCAGCC TTGCAATCTC 500 

GGCGCAGGAG CAAATTGCAT TTCTCAGGAA GCTCTATCGT AACGAGCTGC 550 

CCTTTCGGGT AGAACATCAG CGCTTGGTCA AGGATCTCAT GATTGTGGAA 600 

GCCGGTCGCA ACTGGATACT GCGTGCAAAG ACGGGCTGGG AAGGCCGTAT 650 

GGGTTGGTGG GTAGGATGGG TTGAGTGGCC GACTGGCTCC GTATTCTTCG 7 00 

CACTGAATAT TGATACGCCA AACAGAATGG ATGATCTTTT CAAGAGGGAG 750 

GCAATCGTGC GGGCAATCCT TCGCTCTATT GAAGCGTTAC CGCCCAACCC 800 

GGCAGTCAAC TCGGACGCTG CGCGATAA 828 



2) INFORMATION FOR SEQ ID NO: 1439 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 801 bases 

(B) TYPE: Nucleic acid 

(C) STRANDEDNESS: Double 

(D) TOPOLOGY: Linear 

(ii) MOLECULE TYPE: Genomic DNA 

(vi) ORIGINAL SOURCE: 

(A) ORGANISM: Pseudomonas aeruginosa 
(C) ACCESSION NUMBER: J03427 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 1439 

ATGAT^AACAT TTGCCGCATA TGTAATTATC GCGTGTCTTT CGAGTACGGC 50 

758 
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ATTAGCTGGT TCAATTACAG AAAATACGTC TTGGAACAAA GAGTTCTCTG 100 

CCGAAGCCGT CAATGGTGTC TTCGTGCTTT GTAAAAGTAG CAGTAAATCC 150 

TGCGCTACCA ATGACTTAGC TCGTGCATCA AAGGAATATC TTCCAGCATC 200 

AACATTTAAG ATCCCCAACG CAATTATCGG CCTAGAAACT GGTGTCATAA 250 

5 AGAATGAGCA TCAGGTTTTC AAATGGGACG GAAAGCCAAG AGCCATGAAG 300 

CAATGGGAAA GAGACTTGAC CTTAAGAGGG GCAATACAAG TTTCAGCTGT 350 

TCCCGTATTT CAACAAATCG CCAGAGAAGT TGGCGAAGTA AGAATGCAGA 400 

AATACCTTAA AAAATTTTCC TATGGCAACC AGAATATCAG TGGTGGCATT 450 

GACAAATTCT GGTTGGAAGG CCAGCTTAGA ATTTCCGCAG TTAATCAAGT 500 

10 GGAGTTTCTA GAGTCTCTAT ATTTAAATAA ATTGTCAGCA TCTAAAGAAA 550 

ACCAGCTAAT AGTAAAAGAG GCTTTGGTAA CGGAGGCGGC ACCTG7\ATAT 600 

CTAGTGCATT CAAAAACTGG TTTTTCTGGT GTGGGAACTG AGTCAAATCC 650 

TGGTGTCGCA TGGTGGGTTG GGTGGGTTGA GAAGGAGACA GAGGTTTACT 700 

TTTTCGCCTT TAACATGGAT ATAGACAACG AAAGTAAGTT GCCGCTAAGA 750 

15 AAATCCATTC CCACCAAAAT CATGGAAAGT GAGGGCATCA TTGGTGGCTA 800 

A 801 



2 0 2) INFORMATION FOR SEQ ID NO: 14 40 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 26 bases 

(B) TYPE: Nucleic acid 
25 (C) STRANDEDNESS: Single 

(D) TOPOLOGY: Linear 

(ii) MOLECULE TYPE: DNA 
30 (xi) SEQUENCE DESCRIPTION: SEQ ID NO: 1440 

AGACCGTTAT CGTAAACAGG GCTAAG 26 



35 



45 



50 



2) INFORMATION FOR SEQ ID NO: 1441 



(i) SEQUENCE CHARACTERISTICS: 
(A) LENGTH: 26 bases 
40 (B) TYPE: Nucleic acid 

(C) STRANDEDNESS: Single 

(D) TOPOLOGY: Linear 



(ii) MOLECULE TYPE: DNA 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 1441 
TTTTTTGCTC AAACTTTTTC AGGATC 26 

2) INFORMATION FOR SEQ ID NO: 1442 
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(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 927 bases 

(B) TYPE: Nucleic acid 

(C) STRANDEDNESS: Double 
5 (D) TOPOLOGY: Linear 

(ii) MOLECULE TYPE: Genomic DNA 

(vi) ORIGINAL SOURCE: 
10 (A) ORGANISM: Pseudomonas aeruginosa 

(B) STRAIN: RNL-1 
. (C) ACCESSION NUMBER: Z21957 



15 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 1442 



ATGAATGTCA TTATAAAAGC TGTAGTTACT GCCTCGACGC TACTGATGGT 50 

ATCTTTTAGT TCATTCGAAA CCTCAGCGCA ATCCCCACTG TTAAAAGAGC 100 

AAATTGAATC CATAGTCATT GGAAAAAAAG CCACTGTAGG CGTTGCAGTG 150 

TGGGGGCCTG ACGATCTGGA ACCTTTACTG ATTAATCCTT TTGAAAAATT 200 

20 CCCAATGCAA AGTGTATTTA AATTGCATTT AGCTATGTTG GTACTGCATC 250 

AGGTTGATCA GGGAAAGTTG GATTTAAATC AGACCGTTAT CGTAAACAGG 300 

GCTAAGGTTT TACAGAATAC CTGGGCTCCG ATAATGAAAG CGTATCAGGG 350 

AGACGAGTTT AGTGTTCCAG TGCAGCAACT GCTGCAATAC TCGGTCTCGC 400 

ACAGCGATAA CGTGGCCTGT GATTTGTTAT TTGAACTGGT TGGTGGACCA 4 50 

25* GCTGCTTTGC ATGACTATAT CCAGTCTATG GGTATT^AAGG AGACCGCTGT 500 

GGTCGCAAAT GAAGCGCAGA TGCACGCCGA TGATCAGGTG CAGTATCAAA 550 

ACTGGACCTC GATGAAAGGT GCTGCAGAGA TCCTGAAAAA GTTTGAGCAA 600 

AAAACACAGC TGTCTGAAAC CTCGCAGGCT TTGTTATGGA AGTGGATGGT 650 

CGAAACCACC ACAGGACCAG AGCGGTTAAA AGGTTTGTTA CCAGCTGGTA 700 

30 CTGTGGTCGC ACATAAAACT GGTACTTCGG GTATCAAAGC CGGAAAAACT 750 

GCGGCCACTA ATGATTTAGG TATCATTCTG TTGCCTGATG GACGGCCCTT 800 

GCTGGTTGCT GTTTTTGTGA AAGACTCAGC CGAGTCAAGC CGAACCAATG 850 

AAGCTATCAT TGCGCAGGTT GCTCAGACTG CGTATCAATT TGAATTGAAA 900 

AAGCTTTCTG CCCTAAGCCC AAATTAA 927 

35 



2) INFORMATION FOR SEQ ID NO: 1443 

40 (i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 23 bases 

(B) TYPE: Nucleic acid 

(C) STRANDEDNESS: Single 

(D) TOPOLOGY: Linear 

45 

(ii) MOLECULE TYPE: DNA 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 1443 
50 CTTCTGCTCT GCTGATGCTT GGC 23 
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2) INFORMATION FOR SEQ ID NO: 1444 



PCT/CAOO/01150 



10 



15 



(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 26 bases 

(B) TYPE: Nucleic acid 

(C) STRANDEDNESS: Single 

(D) TOPOLOGY: Linear 

(ii) MOLECULE TYPE: DNA 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 1444 
GGCGACCAGG TATTTTGTAA TACTGC 26 

2) INFORMATION FOR SEQ ID NO: 1445 



(i) SEQUENCE CHARACTERISTICS: 
20 (A) LENGTH: 927 bases 

(B) TYPE: Nucleic acid 

(C) STRANDEDNESS: Double 

(D) TOPOLOGY: Linear 

25 (ii) MOLECULE TYPE: Genomic DNA 

(vi) ORIGINAL SOURCE: 

(A) ORGANISM: Salmonella typhimurium 

(B) STRAIN: JMC 

30 (C) ACCESSION NUMBER: X93314 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 1445 

ATGAATGTCA TCACAT^yVTC TGTTTTCACC GCTTCTGCTC TGCTGATGCT 50 

35 TGGCTTAAGT TCATTTGTAG TATCAGCCCA ATCCCCTTTG TTAAAAGAGC 100 

AGATTGAAAC CATAGTGACG GGTAAAAAGG CCACTGTAGG TGTAGCAGTG 150 

TGGGGGCCTG ACGATCTGGA ACCTTTGTTG CTGAATCCAT TTGAAAAGTT 200 

TCCGATGCAA AGTGTGTTTA AACTGCATTT AGCTATGTTA GTTCTGCATC 250 

AGGTCGATCA GGGGAAACTG GATTTAAATC AGTCTGTTAC TGTTAATCGT 300 

40 GGTGCAGTAT TACAAAATAC CTGGTCGCCA ATGATGAAAG ATCATCAGGG 350 

CGATGAATTT ACTGTTGCAG TACAGCAGTT ACTGCAGTAT TCGGTGTCAC 4 00 

ACAGCGACAA TGTGGCCTGC GATTTGTTAT TTGAACTGGT GGGCGGGCCG 4 50 

CAAGCTTTGC ATGCTTATAT CCAGTCTTTA GGCGTTAAAG AAGCTGCCGT 500 

GGTAGCAAAT GAAGCGCAAA TGCATGCGGA TGATCAGGTG CAATATCAAA 550 

45 ACTGGACGTC GATGAAAGCC GCAGCACAAG TTCTGCAAAA GTTTGAACAG 600 

AAAAAGCAGT TGTCTGAAAC CTCTCAGGCC TTGTTATGGA AATGGATGGT 650 

TGAAACCACC ACAGGACCAC AGCGGTTAAA AGGCTTGTTA CCTGCTGGTA 700 

CTATAGTGGC GCATAAAACC GGTACTTCGG GCGTCAGAGC AGGAAAAACT 7 50 

GCGGCGACTA ATGATGCGGG CGTCATTATG TTGCCTGATG GACGGCCTTT 800 

50 ATTGGTGGCG GTATTTGTCA AGGATTCGGC TGAATCAGAA CGAACCAATG 850 

AAGCTATTAT TGCGCAGGTT GCGCAAGCGG CTTATCAGTT TGAGCTGAAA 900 

AAACTCTCTG CAGTGAGTCC GGATTGA 927 
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2) INFORMATION FOR SEQ ID NO: 1446 
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15 



(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 27 bases 

(B) TYPE: Nucleic acid 

(C) STRANDEDNESS : Single 

(D) TOPOLOGY: Linear 

(ii) MOLECULE TYPE: DNA 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 1446 
GGCCTGYGAT TTGTTATTTG AACTGGT 27 

2) INFORMATION FOR SEQ ID NO: 1447 



(i) SEQUENCE CHARACTERISTICS: 
20 (A) LENGTH: 23 bases 

(B) TYPE: Nucleic acid 

(C) STRANDEDNESS: Single 

(D) TOPOLOGY: Linear 

25 (ii) MOLECULE TYPE: DNA 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 1447 
CGCTSTGGTC CTGTGGTGGT TTC 23 

30 

2) INFORMATION FOR SEQ ID NO: 1448 

35 (i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 26 bases 

(B) TYPE: Nucleic acid 

(C) STRANDEDNESS: Single 

(D) TOPOLOGY: Linear 

40 

(ii) MOLECULE TYPE: DNA 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 1448 
45 GATCAGGTGC ARTATCAAAA CTGGAC 26 



2) INFORMATION FOR SEQ ID NO: 1449 

50 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 26 bases 

(B) TYPE: Nucleic acid 

762 



SUBSTITUTE SHEET (RULE 26) 



10 



wo 01/23604 PCT/CAOO/01150 

(C) STRANDEDNESS : Single 

(D) TOPOLOGY: Linear 

(ii) MOLECULE TYPE: DNA 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 1449 
AGCWGGTAAC AAYCCTTTTA ACCGCT 26 



2) INFORMATION FOR SEQ ID NO: 1450 



(i) SEQUENCE CHARACTERISTICS: 
15 (A) LENGTH: 26 bases 

(B) TYPE: Nucleic acid 

(C) STRANDEDNESS: Single 

(D) TOPOLOGY: Linear 

20 (ii) MOLECULE TYPE: DNA 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 1450 
ACCACTGGGA ATACACTTGT AATGGC 26 

25. 

2 ) INFORMATION FOR SEQ ID NO: 1451 

30 (i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 26 bases 

(B) TYPE: Nucleic acid 

(C) STRANDEDNESS: Single 

(D) TOPOLOGY: Linear 

35 

(ii) MOLECULE TYPE: DNA 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 1451 

40 ATCTACCTGG TCAATCATTG CTTCGT 26 



45 



2) INFORMATION FOR SEQ ID NO: 1452 



(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 486 bases 

(B) TYPE: Nucleic acid 

(C) STRANDEDNESS: Double 
50 (D) TOPOLOGY: Linear 

(ii) MOLECULE TYPE: Genomic DNA 
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(vi) ORIGINAL SOURCE: 

(A) ORGANISM: Staphylococcus epidermidis 

(B) STRAIN: BM10393 

(C) ACCESSION NUMBER: AF04 5472 

5 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 1452 

ATGACATTAT CAATAATTGT CGCTCACGAT AAACAAAGAG TCATTGGGTA 50 

CCAAAATCAA TTACCTTGGC ACTTACCT^AA TGATTTAAAG CATATTAAAC 100 

10 AACTGACCAC TGGGAATACA CTTGTAATGG CACGGAAAAC TTTTAATTCT 150 

ATAGGGAAGC CATTGCCAAA TAGACGTAAC GTCGTACTCA CT7y\CCAAGC 200 

TTCATTTCAC CATGAAGGGG TAGATGTTAT AAACTCTCTT GATGAAATTA 250 

AAGAGTTATC TGGTCATGTT TTTATATTTG GAGGACAAAC GTTATACGAA 300 

GCAATGATTG ACCAGGTAGA TGATATGTAT ATCACAGTAA TAGATGGAAA 350 

15 GTTTCAAGGA GACACATTCT TTCCACCATA CACATTCGAA AACTGGGAAG 4 00 

TCGAATCTTC AGTAGAAGGT CAACTAGATG AAAAAAATAC TATACCGCAT 4 50 

ACATTCTTAC ATTTAGTGCG TAGAAAAGGG AAATAG 486 



20 



30 



35 



2) INFORMATION FOR SEQ ID NO: 1453 



(i) SEQUENCE CHARACTERISTICS: 
(A) LENGTH: 26 bases 
25 (B) TYPE: Nucleic acid 

(C) STRANDEDNESS: Single 

(D) TOPOLOGY: Linear 



(ii) MOLECULE TYPE: DNA 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 1453 
ATCGAAGAAT GGAGTTATCG GRAATG 26 

2) INFORMATION FOR SEQ ID NO: 1454 



(i) SEQUENCE CHARACTERISTICS: 
40 (A) LENGTH: 25 bases 

(B) TYPE: Nucleic acid 

(C) STRANDEDNESS: Single 

(D) TOPOLOGY: Linear 

45 (ii) MOLECULE TYPE: DNA 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 1454 
CCTAAAAYTR 'CTGGGGATTT CWGGA 25 

50 



2) INFORMATION FOR SEQ ID NO: 1455 
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(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 24 bases 

(B) TYPE: Nucleic acid 

(C) STRANDEDNESS: Single 

(D) TOPOLOGY: Linear 

(ii) MOLECULE TYPE: DNA 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 1455 
CAGGTGGTGG GGAGATATAC AAAA 



2) INFORMATION FOR SEQ ID NO: 1456 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 2 6 bases 

(B) TYPE: Nucleic acid 

(C) STRANDEDNESS: Single 

(D) TOPOLOGY: Linear 

(ii) MOLECULE TYPE: DNA 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 1456 
TATGTTAGAS RCGAAGTCTT GGKTAA 



2) INFORMATION FOR SEQ ID NO: 1457 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 22 bases 

(B) TYPE: Nucleic acid 

(C) STRANDEDNESS: Single 

(D) TOPOLOGY: Linear 

(ii) MOLECULE TYPE: DNA 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 1457 
CAAAGGTGAA CAGCTCCTGT TT 



2) INFORMATION FOR SEQ ID NO: 1458 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 27 bases 

(B) TYPE: Nucleic acid 

(C) STRANDEDNESS: Single 
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(D) TOPOLOGY: Linear 

(ii) MOLECULE TYPE: DNA 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 1458 
TCCGTTATTT TCTTTAGGTT GGTTAAA 



2) INFORMATION FOR SEQ ID NO: 1459 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 20 bases 

(B) TYPE: Nucleic acid 

(C) STRANDEDNESS: Single 

(D) TOPOLOGY: Linear 

(ii) MOLECULE TYPE: DNA 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 1459 
AAGGTGAACA GCTCCTGTTT 



2) INFORMATION FOR SEQ ID NO: 1460 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 22 bases 

(B) TYPE: Nucleic acid 

(C) STRANDEDNESS: Single 

(D) TOPOLOGY: Linear 

(ii) MOLECULE TYPE: DNA 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 1460 
GATCACTACG TTCTCATTGT CA 



2) INFORMATION FOR SEQ ID NO: 1461 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 474 bases 

(B) TYPE: Nucleic acid 

(C) STRANDEDNESS: Double 

(D) TOPOLOGY: Linear 

(ii) MOLECULE TYPE: Genomic DNA 
(vi) ORIGINAL SOURCE: 
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(A) ORGANISM: Escherichia coli 
(C) ACCESSION NUMBER: AJ238350 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 1461 

5 

GTGAAACTAT CACTAATGGT AGCTATATCG AAGAATGGAG TTATCGGGAA 50 

TGGCCCTGAT ATTCCATGGA GTGCCAAAGG TGAACAGCTC CTGTTTAAAG 100 

CTATTACCTA TAACCAATGG CTGTTGGTTG GACGCAAGAC TTTTGAATCA 150 

ATGGGAGCAT TACCCAACCG AAAGTATGCG GTCGTAACAC GTCCAAGTTT 200 

10 TACATCTGAC AATGAGAACG TAGTGATCTT TCCATCAATT AAAGATGCTT 250 

TAACCAACCT AAAGAAAATA ACGGATCATG TCATTGTTTC AGGTGGTGGG 300 

GAGATATACA AT^GCCTGAT CGATCAAGTA GATACACTAC ATATATCTAC 350 

AATAGACATC GAGCCGGAAG GTGATGTTTA CTTTCCTGAA ATCCCCAGCA 4 00 

ATTTTAGGCC AGTTTTTACC CAAGACTTCG CCTCTAACAT AAATTATAGT 450 

15 TACCAAATCT GGCAAAAGGG TTAA 474 



20 



2) INFORMATION FOR SEQ ID NO: 1462 



(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 23 bases 

(B) TYPE: Nucleic acid 

(C) STRANDEDNESS: Single 
25 (D) TOPOLOGY: Linear 

(ii) MOLECULE TYPE: DNA 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 1462 

30 

GCACTCCCYA ATAGGAAATA CGC 23 



35 2) INFORMATION FOR SEQ ID NO: 1463 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 23 bases 

(B) TYPE: Nucleic acid 
40 (C) STRANDEDNESS: Single 

(D) TOPOLOGY: Linear 

(ii) MOLECULE TYPE: DNA 
45 (xi) SEQUENCE DESCRIPTION: SEQ ID NO: 14 63 

AGTGTTGCTC AAAAACAACT TCG 23 

50 

2) INFORMATION FOR SEQ ID NO: 1464 

(i) SEQUENCE CHARACTERISTICS: 
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(A) LENGTH: 23 bases 

(B) TYPE: Nucleic acid' 

(C) STRANDEDNESS: Single 

(D) TOPOLOGY: Linear 

(ii) MOLECULE TYPE: DNA 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 1464 
ACGTTYGAAT CTATGGGMGC ACT 



2) INFORMATION FOR SEQ ID NO: 1465 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 23 bases 

(B) TYPE: Nucleic acid 

(C) STRANDEDNESS: Single 

(D) TOPOLOGY: Linear 

(ii) MOLECULE TYPE: DNA 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 1465 
GTCGATAAGT GGAGCGTAGA GGC 



2) INFORMATION FOR SEQ ID NO: 1466 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 23 bases 

(B) TYPE: Nucleic acid 

(C) STRANDEDNESS: Single 

(D) TOPOLOGY: Linear 

(ii) MOLECULE TYPE: DNA 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 1466 
AAGCATTGAC CTACAATCAG TGT 



2) INFORMATION FOR SEQ ID NO: 1467 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 27 bases 

(B) TYPE: Nucleic acid 

(C) STRANDEDNESS: Single 

(D) TOPOLOGY: Linear 
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(ii) MOLECULE TYPE: DNA 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 1467 
AATACAACTA CATTGTCATC ATTTGAT 



2) INFORMATION FOR SEQ ID NO: 1468 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 26 bases 

(B) TYPE: Nucleic acid 

(C) STRANDEDNESS : Single 

(D) TOPOLOGY: Linear 

(ii) MOLECULE TYPE: DNA 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 1468 
CGTTACCCGC TCAGGTTGGA CATCAA 



2) INFORMATION FOR SEQ ID NO: 1469 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 23 bases 

(B) TYPE: Nucleic acid 

(C) STRANDEDNESS: Single 

(D) TOPOLOGY: Linear 

(ii) MOLECULE TYPE: DNA 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 1469 
CATCCCCCTC TGGCTCGATG TCG 



2) INFORMATION FOR SEQ ID NO: 1470 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 4 74 bases 

(B) TYPE: Nucleic acid 

(C) STRANDEDNESS: Double 

(D) TOPOLOGY: Linear 

(ii) MOLECULE TYPE: Genomic DNA 

(vi) ORIGINAL SOURCE: 

(A) ORGANISM: Escherichia coli 
(C) ACCESSION NUMBER: Z50804 
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(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 1470 

TTGT^AAGTAT CATTGATAGC TGCGAAACGA AAAT^CGGCG TGATTGGTTG 50 

5 CGGTCCAGAC ATACCGTGGT CCGCGAAAGG GGAGCAGCTA CTTTTTAAAG 100 

CATTGACCTA CAATCAGTGT CTTCTGGTGG GTCGCAAGAC GTTTGAATCT 150 

ATGGGCGCAC TCCCCAATAG GAAATACGCG GTCGTTACCC GCTCAGGTTG 200 

GACATCAAAT GATGACAATG TAGTTGTATT TCAGTCAATC GAAGAGGCCA 250 

TGGACAGGCT AGCTGAATTC ACCGGTCACG TTATAGTGTC TGGTGGCGGA 300 

10 GAAATTTACC GAGAAACATT ACCCATGGCC TCTACGCTCC ACTTATCGAC 350 

GATCGACATC GAGCCAGAGG GGGATGTTTT CTTCCCGAGT ATTCCAAATA 400 

CCTTCGAAGT TGTTTTTGAG CAACACTTTA CTTCAAACAT TAACTATTGC 4 50 

TATCAAATTT GGAAAAAGGG TTAA 474 



15 



2) INFORMATION FOR SEQ ID NO: 1471 



(i) SEQUENCE CHARACTERISTICS: 
20 (A) LENGTH: 26 bases 

(B) TYPE: Nucleic acid 

(C) STRANDEDNESS: Single 

(D) TOPOLOGY: Linear 

25 (ii) MOLECULE TYPE: DNA 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 1471 
GATAATGACA ACGT7\ATAGT ATTCCC 26 

30 



2) INFORMATION FOR SEQ ID NO: 1472 

35 (i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 23 bases 

(B) TYPE: Nucleic acid 

(C) STRANDEDNESS: Single 

(D) TOPOLOGY: Linear 

40 

(ii) MOLECULE TYPE: DNA 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 1472 
45 GCTCAATATC AATCGTCGAT ATA 



2) INFORMATION FOR SEQ ID NO: 1473 

50 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 26 bases 

(B) TYPE: Nucleic acid 
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(C) STRANDEDNESS: Single 

(D) TOPOLOGY: Linear 

(ii) MOLECULE TYPE: DNA 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 1473 
TTAAAGCCTT GACGTACAAC CAGTGG 26 



2) INFORiyiATION FOR SEQ ID NO: 1474 



(i) SEQUENCE CHARACTERISTICS: 
15 (A) LENGTH: 26 bases 

(B) TYPE: Nucleic acid 

(C) STRANDEDNESS: Single 

(D) TOPOLOGY: Linear 

20. (ii) MOLECULE TYPE: DNA 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 1474 
TGGGCAATGT TTCTCTGTAA ATCTCC 26 

25 



2) INFORMATION FOR SEQ ID NO: 1475 

30 (i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 474 bases 

(B) TYPE: Nucleic acid 

(C) STRANDEDNESS: Double 

(D) TOPOLOGY: Linear 



35 



(ii) MOLECULE TYPE: Genomic DNA 



(vi) ORIGINAL SOURCE: 

(A) ORGANISM: Escherichia coli 
40 (C) ACCESSION NUMBER: X12868 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 1475 



GTGAAAGTAT CATTAATGGC TGCAAAAGCG 

45 CGGTCCACAC ATACCCTGGT CCGCGAAAGG 

CCTTGACGTA CAACCAGTGG CTTTTGGTGG 

ATGGGAGCAC TCCCTAATAG GTU^TACGCG 

GACGGCCGAT AATGACAACG TAATAGTATT 

TGTACGGGCT GGCTGAACTC ACCGATCACG 

50 GAGATTTACA GAGAAACATT GCCCATGGCC 

GATTGATATT GAGCCGGAAG GAGATGTTTT 

CCTTCGAAGT TGTTTTTGAG CAACACTTTA 

TATCAAATTT GGCAAAAGGG TTAA 



AAAT^CGGAG TGATTGGTTG 50 

AGAGCAGCTA CTCTTTAAAG 100 

GCCGCAAGAC GTTCGAATCT 150 

GTCGTTACTC GCTCAGCCTG 200 

CCCGTCGATC GAAGAGGCCA 250 

TTATAGTGTC TGGTGGCGGG 300 

TCTACGCTCC ATATATCGAC 350 

CTTTCCGAAT ATTCCCAATA 4 00 

GCTCAAACAT TAACTATTGC 4 50 

474 
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2) INFORMATION FOR SEQ ID NO: 1476 
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10 



15 



(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 22 bases 

(B) TYPE: Nucleic acid 

(C) STRANDEDNESS: Single 

(D) TOPOLOGY: Linear 

(ii) MOLECULE TYPE: DNA 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 1476 
GGCGAGCAGC TCCTATTCAA AG 22 

2) INFORMATION FOR SEQ ID NO: 1477 



(i) SEQUENCE CHARACTERISTICS: 
20 (A) LENGTH: 25 bases 

(B) TYPE: Nucleic acid 

(C) STRANDEDNESS: Single 

(D) TOPOLOGY: Linear 

25 (ii) MOLECULE TYPE: DNA 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 1477 
TAGGTAAGCT AATGCCGATT CAACA 25 

30 



2) INFORMATION FOR SEQ ID NO: 1478 

35 (i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 26 bases 

(B) TYPE: Nucleic acid 

(C) STRANDEDNESS: Single 

(D) TOPOLOGY: Linear 

40 

(ii) MOLECULE TYPE: DNA 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 1478 
45 GAGAATGGAG TAATTGGCTC TGGATT 26 



50 



2) INFORMATION FOR SEQ ID NO: 1479 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 25 bases 

(B) TYPE: Nucleic acid 
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10 



40 



(C) STRANDEDNESS: Single 

(D) TOPOLOGY: Linear 

(ii) MOLECULE TYPE: DNA 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 1479 
GCGAAATACA CAACATCAGG GTCAT 25 

2) INFORMATION FOR SEQ ID NO: 1480 



(i) SEQUENCE CHARACTERISTICS: 
15 (A) LENGTH: 474 bases 

(B) TYPE: Nucleic acid 

(C) STRANDEDNESS: Double 

(D) TOPOLOGY: Linear 

2 0 (ii) MOLECULE TYPE: Genomic DNA 

(vi) ORIGINAL SOURCE: 

(A) ORGANISM: Proteus mirabilis 

(B) STRAIN: J120 

25 (C) ACCESSION NUMBER: Z8 6002 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 1480 

ATGAAAATAT CTCTTATGGC AGCTGTTTCC GAGAATGGAG TAATTGGCTC 50 

30 TGGATTGGAT ATACCTTGGC ATGTACAAGG CGAGCAGCTC CTATTCAAAG 100 

CCATGACTTA CAATCAATGG CTTCTAGTTG GTCGTAAAAC CTTCGACTCA 150 

ATGGGTAAAC TTCCGAATAG AAAATATGCA GTGGTTACTC GTTCTAAAAT 200 

TATCTCGAAT GACCCTGATG TTGTGTATTT CGCAAGTGTT GAATCGGCAT 250 

TAGCTTACCT AAACAATGCG ACAGCACATA TCTTTGTTTC TGGTGGTGGT 300 

35 GAAATATATA AAGCTTTAAT CGATCAAGCA GATGTTATCC ATCTTTCAGT 350 

GATTCACAAG CATATCTCTG GCGATGTGTT TTTTCCTCCA GTTCCACAGG 400 

GCTTCAAGCA AACATTTGAG CAAAGTTTCA GTTCAAATAT TGATTACACG 450 

TACCAAATTT GGGCAAAGGG CTAA 4 74 



2) INFORMATION FOR SEQ ID NO: 1481 



(i) SEQUENCE CHARACTERISTICS: 
45 (A) LENGTH: 24 bases 

(B) TYPE: Nucleic acid 

(C) STRANDEDNESS: Single 

(D) TOPOLOGY: Linear 

50 (ii) MOLECULE TYPE: DNA 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 1481 
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2) INFORMATION FOR SEQ ID NO: 1482 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 26 bases 

(B) TYPE: Nucleic acid 

(C) STRANDEDNESS: Single 

(D) TOPOLOGY: Linear 

(ii) MOLECULE TYPE: DNA 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 1482 
TAATTTATAT TAGACAWAAA AAACTG 



2) INFORMATION FOR SEQ ID NO: 1483 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 23 bases 

(B) TYPE: Nucleic acid 

(C) STRANDEDNESS: Single 

(D) TOPOLOGY: Linear 

(ii) MOLECULE TYPE: DNA 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 1483 
CARYGTCAGA AAATGGCGTA ATC 



2) INFORMATION FOR SEQ ID NO: 1484 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 26 bases 

(B) TYPE: Nucleic acid 

(C) STRANDEDNESS: Single 

(D) TOPOLOGY: Linear 

(ii) MOLECULE TYPE: DNA 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 1484 
TKCAAAGCRW TTTCTATTGA AGGAAA 



2) INFORMATION FOR SEQ ID NO: 1485 
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(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 23 bases 

(B) TYPE: Nucleic acid 

(C) STEIANDEDNESS : Single 

(D) TOPOLOGY: Linear 

(ii) MOLECULE TYPE: DNA 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 1485 
AAAATGGCGT AATCGGTAAT GGC 



2) INFORMATION FOR SEQ ID NO: 1486 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 26 bases 

(B) TYPE: Nucleic acid 

(C) STRANDEDNESS : Single 

(D) TOPOLOGY: Linear 

(ii) MOLECULE TYPE: DNA 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 1486 
CATTTGAGCT TGAAATTCCT TTCCTC 



2) INFORMATION FOR SEQ ID NO: 1487 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 26 bases 

(B) TYPE: Nucleic acid 

(C) STRANDEDNESS: Single 

(D) TOPOLOGY: Linear 

(ii) MOLECULE TYPE: DNA 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 1487 
AATCGAAAAT ATGCAGTAGT GTCGAG 



2) INFORMATION FOR SEQ ID NO: 1488 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 24 bases 

(B) TYPE: Nucleic acid 

(C) STRANDEDNESS: Single 
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(D) TOPOLOGY: Linear 
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(ii) MOLECULE TYPE: DNA 
5 (xi) SEQUENCE DESCRIPTION: SEQ ID NO: 1488 

AGACTATTGT AGATTTGACC GCCA 24 



10 



20 



25 



40 



50 



2) INFORMATION FOR SEQ ID NO: 1489 



(i) SEQUENCE CHARACTERISTICS: 
(A) LENGTH: 474 bases 
15 (B) TYPE: Nucleic acid 

(C) STRANDEDNESS : Double 

(D) TOPOLOGY: Linear 



(ii) MOLECULE TYPE: Genomic DNA 

(vi) ORIGINAL SOURCE: 

(A) ORGANISM: Escherichia coli 

(B) STRAIN: VA292 

(C) ACCESSION NUMBER: U31119 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 1489 



TTGAAAATTT CATTGATTTC TGCAACGTCA GAAAATGGCG TAATCGGTAA 50 

TGGCCCTGAT ATCCCATGGT CAGCAAAAGG TGAGCAGTTA CTCTTTAJ^G 100 

30 CGCTCACATA TAATCAGTGG CTCCTTGTTG GAAGGAAAAC ATTTGACTCT 150 

ATGGGTGTTC TTCCA2\ATCG AAAATATGCA GTAGTGTCGA GGAAAGGAAT 200 

TTCAAGCTCA AATGAAAATG TATTAGTCTT TCCTTCAATA GAAATCGCTT 250 

TGCAAGAACT ATCGAAAATT ACAGATCATT TATATGTCTC TGGTGGCGGT 300 

CAAATCTACA ATAGTCTTAT TGAAAAAGCA GATATAATTC ATTTGTCTAC 350 

35- TGTTCACGTT GAGGTTGAAG GTGATATCAA TTTTCCTAAA ATTCCAGAGA 400 

ATTTCAATTT GGTTTTTGAG CAGTTTTTTT TGTCTAATAT AAATTACACA 450 

TATCAGATTT GGAAAAAAGG CTAA 474 



2) INFORMATION FOR SEQ ID NO: 1490 



(i) SEQUENCE CHARACTERISTICS: 
(A) LENGTH: 25 bases 
45 (B) TYPE: Nucleic acid 

(C) STRANDEDNESS: Single 

(D) TOPOLOGY: Linear 



(ii) MOLECULE TYPE: DNA 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 1490 

GACCTATGAG AGCTTGCCCG TCAAA 25 
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2) INFORMATION FOR SEQ ID NO: 1491 

(i) SEQUENCE CHARACTERISTICS: 
(A) LENGTH: 26 bases 
{B) TYPE: Nucleic acid 

(C) STRANDEDNESS: Single 

(D) TOPOLOGY: Linear 

(ii) MOLECULE TYPE: DNA 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 1491 
TCGCCTTCGT ACAGTCGCTT AACAAA 



2) INFORMATION FOR SEQ ID NO: 1492 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 26 bases 

(B) TYPE: Nucleic acid 

(C) STRANDEDNESS: Single 

(D) TOPOLOGY: Linear 

(ii) MOLECULE TYPE: DNA 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 1492 
CATTTTAGCT GCCACCGCCA ATGGTT 



2) INFORMATION FOR SEQ ID NO: 1493 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 25 bases 

(B) TYPE: Nucleic acid 

(C) STRANDEDNESS: Single 

(D) TOPOLOGY: Linear 

(ii) MOLECULE TYPE: DNA 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 1493 
GCGTCGCTGA CGTTGTTCAC GAAGA 



2) INFORMATION FOR SEQ ID NO: 1494 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 510 bases 

(B) TYPE: Nucleic acid 

777 

SUBSTITUTE SHEET (RULE 26) 



wo 01/23604 



PCT/CAOO/01150 



10 



25 



(C) STRANDEDNESS : Double 

(D) TOPOLOGY: Linear 

(ii) MOLECULE TYPE: Genomic DNA 

(vi) ORIGINAL SOURCE: 

(A) ORGANISM: Escherichia coli 

(B) STRAIN: BL26A 

(C) ACCESSION NUMBER: U10186 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 1494 



ATGATCGAGC TTCATGCCAT TTTAGCTGCC ACCGCCAATG GTTGCATTGG 50 

GAAGGACAAC GCACTTCCCT GGCCACCACT AAAAGGCGAT CTGGCCAGAT 100 

15 TCAAAAAATT GACCATGGGG AAGGTGGTCA TTATGGGGCG CAAGACCTAT 150 

GAGAGCTTGC CCGTCAAATT AGAAGGTCGC ACCTGCATCG TTATGACGCG 200 

CCAAGCGCTG GAGCTTCCGG GTGTTCGTGA CGCTAACGGC GCTATCTTCG 250 

TGAACAACGT CAGCGACGCC ATGCGGTTCG CTCAAGAAGA GAGCGTGGGC 300 

GATGTGGCCT ACGTCATTGG TGGCGCTGAG ATATTCAAGC GACTTGCCTT 350 

20 GATGATCACG CAGATTGAAT TGACCTTTGT TAAGCGACTG TACGAAGGCG 400 

ACACCTACGT TGATCTGGCC GAAATGGTCA AAGACTACGA GCAGAATGGC 450 

ATGGAAGAAC ATGACCTTCA CACTTACTTC ACTTACCGTA AAAAGGAGCT 500 

TACAGAATGA 510 



2) INFORMATION FOR SEQ ID NO: 1495 



(i) SEQUENCE CHARACTERISTICS: 
30 (A) LENGTH: 24 bases 

(B) TYPE: Nucleic acid 

(C) STRANDEDNESS: Single 

(D) TOPOLOGY: Linear 

35 (ii) MOLECULE TYPE: DNA 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 1495 
TCTCTAAACA TGATTGTCGC TGTC 24 

40 



2) INFORMATION FOR SEQ ID NO: 1496 

45 (i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 24 bases 

(B) TYPE: Nucleic acid 

(C) STRANDEDNESS: Single 

(D) TOPOLOGY: Linear 



50 



(ii) MOLECULE TYPE: DNA 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 1496 
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CAGTGAGGCA AAAGTTTTTC TACC 



2) INFORMATION FOR SEQ ID NO: 1497 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 25 bases 

(B) TYPE: Nucleic acid 

(C) STRANDEDNESS: Single 

(D) TOPOLOGY: Linear 

(ii) MOLECULE TYPE: DNA 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 1497 
CGGACGACTT CATGTGGTAG TCAGT 



2) INFORMATION FOR SEQ ID NO: 1498 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 26 bases 

(B) TYPE: Nucleic acid 

(C) STRANDEDNESS: Single 

(D) TOPOLOGY: Linear 

(ii) MOLECULE TYPE: DNA 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 14 98 
TTTGTTTTCA GT/\ATGGTCG GGACCT 



2) INFORMATION FOR SEQ ID NO: 1499 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 534 bases 

(B) TYPE: Nucleic acid 

(C) STRANDEDNESS: Double 

(D) TOPOLOGY: Linear 

(ii) MOLECULE TYPE: Genomic DNA 

(vi) ORIGINAL SOURCE: 

(A) ORGANISM: Escherichia coli 
(C) ACCESSION NUMBER: X57730 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 1499 
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ATGGCTTCTC TAAACATGAT TGTCGCTGTC 

ATTTGAAAAT CAGATTCCGT GGCATGAACC 

AAGCTGTTAC AATGAACTCA GTTTTGATTA 

TCACTGCCTA AAGTGCTGCC CGGACGACTT 

5 AGTACCACCC ACCCAGAACA CTGATCAAGT 

AGATCGCAGT AAGAACTGCA AGCTTGTTGG 

CAAATTTTTG TAATTGGTGG GAAGAGTGCG 
CGTGGACAAA CTCTACTTAA CTAGAGTACA 

CTGAACTGGA TTTATCCCTA TTCAAGTCAT 

10 CCGACCATTA CTGAAAACAA AACAAAACTT 

CCCTAACCCT ATTAGTGAGG AACCCACATG 



PCT/CAOO/01150 

i\ATAAGACAG GAGGTATCGG 50 

AGAAGATTTA AAACACTTCA 100 

TGGGTAGAAA AACTTTTGCC 150 

CATGTGGTAG TCAGTAAAAC 200 

TGTGTATGTA AGTACATACC 250 

TTGACAAACC AGAGTATTCT 300 

TACGAGAACT TAGCTGCCTA 350 

GCTCAACACA CAACAAGACA 400 

GGAAACTCGT ATCTGAGGTC 4 50 

ATTTTCCAAA TTTGGATTAA 500 

TTAG 534 



15 2) INFORMATION FOR SEQ ID NO: 1500 

(i) SEQUENCE CHARACTERISTICS: 

(A) " LENGTH: 24 bases 

(B) TYPE: Nucleic acid 
20 (C) STRANDEDNESS: Single 

(D) TOPOLOGY: Linear 

(ii) MOLECULE TYPE: DNA 
25 (xi) SEQUENCE DESCRIPTION: SEQ ID NO: 1500 

ATCGGGTTAT TGGCAATGGT CCTA 24 



30 



40 



45 



2) INFORMATION FOR SEQ ID NO: 1501 



(i) SEQUENCE CHARACTERISTICS: 
(A) LENGTH: 25 bases 
35 (B) TYPE: Nucleic acid 

(C) STRANDEDNESS: Single 

(D) TOPOLOGY: Linear 



(ii) MOLECULE TYPE: DNA 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 1501 
GCGGTAGTTA GCTTGGCGTG AGATT 25 

2) INFORMATION FOR SEQ ID NO: 1502 



(i) SEQUENCE CHARACTERISTICS: 
50 (A) LENGTH: 22 bases 

(B) TYPE: Nucleic acid 

(C) STRANDEDNESS: Single 

(D) TOPOLOGY: Linear 
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(ii) MOLECULE TYPE: DNA 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 1502 

5 

GCGGGCGGAG CTGAGATATA CA 22 

10 2) INFORMATION FOR SEQ ID NO: 1503 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 26 bases 

(B) TYPE: Nucleic acid 
15 (C) STRANDEDNESS: Single 

(D) TOPOLOGY: Linear 

(ii) MOLECULE TYPE: DNA 
20 (xi) SEQUENCE DESCRIPTION: SEQ ID NO: 1503 

AACGGAGTGG GTGTACGGAA TTACAG 26 



25 



35 



40 



2) INFORMATION FOR SEQ ID NO: 1504 



(i) SEQUENCE CHARACTERISTICS: 
(A) LENGTH: 4 98 bases 
30 (B) TYPE: Nucleic acid 

(C) STRANDEDNESS: Double 

(D) TOPOLOGY: Linear 



(ii) MOLECULE TYPE: Genomic DNA 

(vi) ORIGINAL SOURCE: 

(A) ORGANISM: Escherichia coll 

(B) STRAIN: TKS84 

(C) ACCESSION NUMBER: Z21672 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 1504 



ATGAACTCGG AATCAGTACG CATTTATCTC GTTGCTGCGA TGGGAGCCAA 50 

TCGGGTTATT GGCAATGGTC CTAATATCCC CTGGAAAATT CCGGGTGAGC 100 

45 AGAAGATTTT TCGCAGACTC ACTGAGGGAA AAGTCGTTGT CATGGGGCGA 150 

AAGACCTTTG AGTCTATCGG CAAGCCTCTA CCGAACCGTC ACACATTGGT 200 

AATCTCACGC CAAGCTAACT ACCGCGCCAC TGGCTGCGTA GTTGTTTCAA 250 

CGCTGTCGCA CGCTATCGCT TTGGCATCCG AACTCGGCAA TGAACTCTAC 300 

GTCGCGGGCG GAGCTGAGAT ATACACTCTG GCACTACCTC ACGCCCACGG 350 

50 CGTGTTTCTA TCTGAGGTAC ATCAAACCTT CGAGGGTGAC GCCTTCTTCC 4 00 

C7VATGCTCAA CGAAACAGAA TTCGAGCTTG TCTCAACCGA AACCATTCAA 4 50 

GCTGTAATTC CGTACACCCA CTCCGTTTAT GCGCGTCGAA ACGGCTAA 4 98 
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2) INFORMATION FOR SEQ ID NO: 1505 



(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 24 bases 

(B) TYPE: Nucleic acid 

(C) STRANDEDNESS: Single 

(D) TOPOLOGY: Linear 

(ii) MOLECULE TYPE: DNA 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 1505 
ATTTTTCGCA GGCTCACCGA GAGC 



2) INFORMATION FOR SEQ ID NO: 1506 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 27 bases 

(B) TYPE: Nucleic acid 

(C) STRANDEDNESS: Single 

(D) TOPOLOGY: Linear 

(ii) MOLECULE TYPE: DNA 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 1506 
CGGATGAGAC AACCTCGAAT TCTGCTG 



2) INFORMATION FOR SEQ ID NO: 1507 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 4 98 bases 

(B) TYPE: Nucleic acid 

(C) STRANDEDNESS: Double 

( D) TOPOLOGY : Linear 

(ii) MOLECULE TYPE: Genomic DNA 

(vi) ORIGINAL SOURCE: 

(A) ORGANISM: Escherichia coli 

(B) STRAIN: RA33.2 

(C) ACCESSION NUMBER: Z50802 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 1507 

ATGAACCCGG AATCGGTCCG CATTTATCTG GTCGCTGCCA TGGGTGCCAA 50 

TCGGGTTATT GGCAATGGTC CCGATATCCC CTGGAAAATC CCAGGTGAGC 100 

AGAAGATTTT TCGCAGGCTC ACCGAGAGCA AAGTGGTCGT TATGGGCCGC 150 

AAGACATTTG AGTCCATAGG CAAGCCCTTA CCAAACCGCC ACACAGTGGT 200 
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GCTCTCGCGC CAAGCTGGTT ATAGCGCTCC TGGTTGTGCA GTTGTTTCAA 250 

CGCTGTCACA CGTATCGCCA TCGACAGCCG AACACGGCAA AGAACTCTAC 300 

GTAGCGCGCG GAGCCGAGGT ATATGCGCTG GCGCTACCGC ATGCCAACGG 350 

CGTCTTTCTA TCTGAGGTAC ATCAAACCTT TGAGGGTGAC GCCTTCTTCC 400 

CAGTGCTTAA CGCAGCAGAA TTCGAGGTTG TCTCATCCGA AACCATTCAA 4 50 

GGCACAATCA CGTACACGCA CTCCGTCTAT GCGCGTCGTA ACGGCTAA 498 



10 2) INFORMATION FOR SEQ ID NO: 1508 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 27 bases 

(B) TYPE: Nucleic acid 
15 (C) STRANDEDNESS: Single 

(D) TOPOLOGY: Linear 

(ii) MOLECULE TYPE: DNA 
20 (xi) SEQUENCE DESCRIPTION: SEQ ID NO: 1508 

AGAATGTATT GGTATTTCCA TCTATCG 27 



25 



35 



40 



2) INFORMATION FOR SEQ ID NO: 1509 



(i) SEQUENCE CHARACTERISTICS: 
(A) LENGTH: 2 6 bases 
30 (B) TYPE: Nucleic acid 

(C) STRANDEDNESS: Single 

(D) TOPOLOGY: Linear 



(ii) MOLECULE TYPE: DNA 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 1509 
CAATGTCGAT TGTTGAAATA TGTAAA 26 

2) INFORMATION FOR SEQ ID NO: 1510 



(i) SEQUENCE CHARACTERISTICS: 
45 (A) LENGTH: 22 bases 

(B) TYPE: Nucleic acid 

(C) STRANDEDNESS: Single 

(D) TOPOLOGY: Linear 

50 (ii) MOLECULE TYPE: DNA 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 1510 
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TGGAGTGCCA AAGGGGAACA AT 
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5 2) INFORMATION FOR SEQ ID NO: 1511 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 27 bases 

(B) TYPE: Nucleic acid 
10 (C) STRANDEDNESS : Single 

(D) TOPOLOGY: Linear 

(ii) MOLECULE TYPE: DNA 
15 (xi) SEQUENCE DESCRIPTION: SEQ ID NO: 1511 

CAGACACAAT CACATGATCC GTTATCG 27 



20 



30 



35 



50 



2) INFORMATION FOR SEQ ID NO: 1512 



(i) SEQUENCE CHARACTERISTICS: 
(A) LENGTH: 474 bases 
25 (B) TYPE: Nucleic acid 

(C) STRANDEDNESS: Double 

(D) TOPOLOGY: Linear 



(ii) MOLECULE TYPE: Genomic DNA 

(vi) ORIGINAL SOURCE: 

(A) ORGANISM: Escherichia coli 

(B) STRAIN: UI14 

(C) ACCESSION NUMBER: Z 8 3331 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 1512 



GTGAAACTAT CACTAATGGC AGCAATTTCG AAGAATGGAG TTATCGGAAA 50 

TGGCCCAGAT ATTCCATGGA GTGCCAAAGG GGAACAATTA CTCTTCAAAG 100 

40 CGATTACCTA TAATCAGTGG CTTTTGGTAG GCCGAAAGAC TTTCGAGTCA 150 

ATGGGGGCTT TACCCAACCG AAAATATGCC GTTGTAACTC GTTCAAGCTT 200 

CACTTCCAGT GATGAGAATG TATTGGTATT TCCATCTATC GATGAAGCGC 250 

TAAATCATCT GAAGACGATA ACGGATCATG TGATTGTGTC TGGTGGTGGT 300 

GAAATATACA AAAGCCTGAT CGATAAAGTT GATACTTTAC ATATTTCAAC 350 

45 AATCGACATT GAGCCAGAAG GTGATGTCTA TTTTCCAGAA ATCCCCAGTA 400 

GTTTTAGGCC AGTTTTTAGC CAAGACTTCG TGTCTAACAT AAATTATAGT 450 

TACCAAATCT GGCAAAAGGG TTAA 474 



2) INFORMATION FOR SEQ ID NO: 1513 

(i) SEQUENCE CHARACTERISTICS: 
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(A) LENGTH: 23 bases 

(B) TYPE: Nucleic acid 

(C) STRANDEDNESS : Single 

(D) TOPOLOGY: Linear 

5 

(ii) MOLECULE TYPE: DNA 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 1513 
10 TTCAAGCTCA AATGAAAACG TCC 23 



15 



2) INFORMATION FOR SEQ ID NO: 1514 



(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 25 bases 

(B) TYPE: Nucleic acid 

(C) STRANDEDNESS: Single 
20 (D) TOPOLOGY: Linear 

(ii) MOLECULE TYPE: DNA 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 1514 

25 

GAAATTCTCA GGCATTATAG GGAAT 25 



30 2) INFORMATION FOR SEQ ID NO: 1515 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 23 bases 

(B) TYPE: Nucleic acid 
35 (C) STRANDEDNESS: Single 

(D) TOPOLOGY: Linear 

(ii) MOLECULE TYPE: DNA 
40 (xi) SEQUENCE DESCRIPTION: SEQ ID NO: 1515 

GTGGTCAGTA AAAGGTGAGC AAC 23 



45 

2) INFORMATION FOR SEQ ID NO: 1516 

(i) SEQUENCE CHARACTERISTICS: 
(A) LENGTH: 26 bases 
50 (B) TYPE: Nucleic acid 

(C) STRANDEDNESS: Single 

(D) TOPOLOGY: Linear ' 
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CTATGTCTCA AGGCCGTGCA ACATACTCTA TGGAATTTGC TAAATATGCT 
GAAACTCCAC GTAACGTGGC TGAAGGCATC ATTTCTAAAT TTCAGTCTGG 
CGGTAAAAAA GGTGACGACG AGTAA 



50 
100 
125 



2) INFORMATION FOR SEQ ID NO: 1519 

(i) SEQUENCE CHARACTERISTICS; 

(A) LENGTH: 93 bases 

(B) TYPE: Nucleic acid 

(C) STRANDEDNESS: Double 

(D) TOPOLOGY: Linear 

(ii) MOLECULE TYPE: Genomic DNA 

(vi) ORIGINAL SOURCE: 

(A) ORGANISM: Acinetobacter Iwoffi 

(B) STRAIN: CDCF 3697 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 1519 

TCTTTCGATT ACTATAAGCC CAAACTAATT CATAGTTAAA AACCAAGTGC 50 
TCATGCAGTG ATCCTGCATG AGTAGTTTAA AAAGGAAGAT CTC 93 



2) INFORMATION FOR SEQ ID NO: 1520 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 1106 bases 

(B) TYPE: Nucleic acid 

(C) STEU^NDEDNESS: Double 

(D) TOPOLOGY: Linear 

(ii) MOLECULE TYPE: Genomic DNA 

(vi) ORIGINAL SOURCE: 

(A) ORGANISM: Aclnetohacter Iwoffi 

(B) STRAIN: CDCF 3697 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 1520 

ATGGCTAAGG CTAAGTTTGA ACGTAATAAG CCACACGTTA ACGTGGGCAC 50 

AATCGGTCAC GTTGACCATG GTAAAACAAC TTTAACAGCT GCAATTGCAA 100 

CTGTATGTGC GAAGAAATTC GGTGGCGAAG CGAAAGACTA CGCTGCAATT 150 

GACTCTGCAC CAGAAGAAAA AGCACGTGGT ATTACAATTA ATACTTCACA 200 

CGTAGAATAC GATTCTCCAA CTCGTCACTA CGCACACGTA GACTGCCCGG 250 

GCCACGCCGA TTATGTTAAA AACATGATTA CTGGTGCTGC TCAGATGGAC 300 

GGCGCGATCC TTGTATGTGC TGCGACTGAT GGTCCAATGC CACAGACTCG 350 

TGAACACATC CTTCTTTCTC GTCAGGTTGG TGTACCTTAC ATTCTTGTAT 400 

TCCTTAACAA GTGTGACCTT GTTGATGATG AAGAACTTCT TGAGCTAGTG 450 
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GAAATGGAAG TTCGTGAACT TCTTTCTACT 

CACTCCAGTT ATCCGTGGTT CAGCTCTTCT 

GTCAGTATGG CGAAGAAGCA GTTGTTGCGC 

TACATTCCAG AGCCAGTACG TGCAATCGAC 

5 CGAAGACGTA TTCTCTATTT CTGGTCGTGG 

TAGAAACTGG TATTGTGAAA GTAGGCGAAT 

CGTGATACTC AAGTAACTAC AGTTACTGGC 

GCTTGACGAA GGTCGTGCGG GCGAGAACTG 

CTAAGCGTGA AGACGTACAA CGTGGTCAAG 

10 ATCAAGCCAC ACACTAAATT CGATGCAGAA 

AGAAGGTGGT CGTCACACTC CATTCCTTAA 

ACTTCCGTAC AACTGACGTA ACTGGCGCGA 

GAAATGGTTA TGCCTGGTGA CAACGTAGAA 
CCCAAT 

15 



PCT/CAOO/01150 

TATGACTTCC CAGGTGATGA 500 

TGCACTTAAC GGTGACGCTG 550 

TTGTTGACGC ACTTGACACT 600 

CAAGCATTCT TAATGCCAAT 650 

TACAGTAGTA ACTGGCCGTG 700 

CAGTTGAAAT CGTTGGTATC 7 50 

GTAGAAATGT TCCGTAAATT 800 

TGGTGTTCTT CTACGTGGTA 850 

TACTTGCTAA ACCAGGTGCA 900 

GTATACGTAC TTTCTAAAGA 950 

CGGTTACCGT CCACAGTTCT 1000 

TCAAATTACA AGATGGCGTT 1050 

ATGTCAGTAG AATTAATCCA 1100 

1106 



2) INFORMATION FOR SEQ ID NO: 1521 

2 0 (i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 100 bases 

(B) TYPE: Nucleic acid 

(C) STRANDEDNESS : Double 

(D) TOPOLOGY: Linear 



25 



(ii) MOLECULE TYPE: Genomic DNA 



(vi) ORIGINAL SOURCE: 

(A) ORGANISM: Haemophilus influenzae 
30 (B) STRAIN: ATCC 9006 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 1521 

ACAAACTCAA GGTCGTGCAT CTTACTCAAT GGAACCGTTA AAATATGCTG 50 
3 5 AAGCTCCAAC AAGTGTTGCG GCTGCAGTAA TTGAAGCGCG TAAAAAATAA 100 



2) INFORMATION FOR SEQ ID NO: 1522 



40 



(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 64 bases 

(B) TYPE: Nucleic acid 

(C) STRANDEDNESS: Double 
45 (D) TOPOLOGY: Linear 

(ii) MOLECULE TYPE: Genomic DNA 

(vi) ORIGINAL SOURCE: 
50 (A) ORGANISM: Haemophilus influenzae 

(B) STRAIN: ATCC 9006 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 1522 
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TTTTTGTAAA CCAGCGGTGT AAAATATGAT TGTTTTATAC CGCACTTCTT 50 
AGGAAACATT AGAA 64 



2) INFORiyiATION FOR SEQ ID NO: 1523 

(i) SEQUENCE CHARACTERISTICS: 
10 (A) LENGTH: 1098 bases 

(B) TYPE: Nucleic acid 

(C) STRANDEDNESS: Double 

(D) TOPOLOGY: Linear 

15 (ii) MOLECULE TYPE: Genomic DNA 

(vi) ORIGINAL SOURCE: 

(A) ORGANISM: Haemophilus influenzae 

(B) STRAIN: ATCC 9006 



20 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 1523 



ATGTCTAAAG AAAAATTTGA ACGTACAAAA CCGCACGTAA ACGTGGGTAC 50 

AATCGGCCAC GTTGACCACG GTAAAACAAC TTT7VACAGCA GCAATTACAA 100 

25 CCGTATTAGC AAAACACTAC GGTGGTGCAG CGCGTGCATT TGACCAAATC 150 

GATAACGCGC CAGAAGAAAA AGCGCGTGGT ATTACCATCA ACACTTCACA 200 

TGTTGAATAC GATACACCAA CTCGCCACTA TGCACACGTA GACTGTCCAG 250 

GACACGCCGA CTATGTTAAA AACATGATTA CCGGTGCGGC GCAAATGGAT 300 

GGTGCTATTT TAGTAGTAGC AGCAACAGAT GGTCCTATGC CACAAACTCG 350 

3 0 TGAACATATC TTATTAGGTC GCCAAGTAGG TGTTCCATAC ATCATCGTAT 400 

TCTTAAACAA ATGCGACATG GTAGATGATG AAGAGTTATT AGAATTAGTA 450 

GAAATGGAAG TGCGTGAACT TCTATCTCAA TATGACTTCC CAGGTGACGA 500 

TACACCAATC GTACGTGGTT CAGCATTACA AGCATTGAAC GGCGTAGCAG 550 

AATGGGAAGA AAAAATCCTT GAATTAGCTG GTCACTTAGA TACTTACATC 600 

35 CCAGAACCAG AACGTGCGAT TGACCAACCG TTCCTTCTTC CAATTGAAGA 650 

CGTATTCTCA ATTTCAGGTC GTGGTACAGT AGTAACTGGT CGTGTAGAAC 700 

GTGGTATCAT CCGTACTGGT GATGAAGTAG AAATCGTCGG TATCAAAGAT 750 

ACAGCGAAAA CTACTGTAAC AGGTGTTGAA ATGTTCCGTA AATTACTTGA 800 

CGAAGGTCGT GCAGGTGAAA ACATCGGTGC ATTATTACGT GGTACCAAAC 850 

40 GTGAAGAAAT CGAACGTGGT CAAGTATTAG CGAAACCAGG TTCAATCACA 900 

CCACACACTG ATTTTGT^TC AGAAGTATAC GTATTATCAA AAGATGAAGG 950 

TGGTCGTCAT ACTCCATTCT TCAAAGGTTA CCGTCCACAA TTCTATTTCC 1000 

GTACAACAGA CGTAACTGGT ACAATTGAAT TACCAGAAGG CGTGGAAATG 1050 

GTAATGCCAG GCGATAACAT CAAGATGACA GTAAGCTTAA TCCACCCA 1098 

45 



2) INFORMATION FOR SEQ ID NO: 1524 

50 (i) SEQUENCE CHARACTERISTICS : ^ 

(A) LENGTH: 77 bases 

(B) TYPE: Nucleic acid 

(C) STRANDEDNESS: Double 
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(D) TOPOLOGY: Linear 



PCT/CAOO/01150 



10 



(ii) MOLECULE TYPE: Genomic DNA 

(vi) ORIGINAL SOURCE: 

(A) ORGANISM: Proteus mirabilis 

(B) STRAIN: ATCC 25933 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 1524 

CAATGGAGTT CTTGAAGTAC AACGAAGCGC CTAGCAACGT CGCTCAGGCT 50 
AT.TATCGAAG CTCGTAAAGC GAAATAA 77 



15 



25 



50 



2) INFORMATION FOR SEQ ID NO: 1525 



(i) SEQUENCE CHARACTERISTICS: 
(A) LENGTH: 67 bases 
20 (B) TYPE: Nucleic acid 

(C) STRANDEDNESS : Double 

(D) TOPOLOGY: Linear 



(ii) MOLECULE TYPE: Genomic DNA 

(vi) ORIGINAL SOURCE: 

(A) ORGANISM: Proteus mirabilis 

(B) STRAIN: ATCC 25933 

30 (xi) SEQUENCE DESCRIPTION: SEQ ID NO: 1525 

GATCCTTTCG AGTTCAATTT AGTTTACGCT CCCTCTGTGA GAGGGAGCGA 50 
TATTAAGGAA TATAGTC 67 

35 

2) INFORMATION FOR SEQ ID NO: 1526 

(i) SEQUENCE CHARACTERISTICS: 
40 (A) LENGTH: 1112 bases 

(B) TYPE: Nucleic acid 

(C) STRANDEDNESS: Double 

(D) TOPOLOGY: Linear 

45 (ii) MOLECULE TYPE: Genomic DNA 

(vi) ORIGINAL SOURCE: 

(A) ORGANISM: Proteus mirabilis 

(B) STRAIN: ATCC 25933 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 1526 

GTGTCTAAAG AAAAATTTGA ACGTTCAAAA CCGCACGTTA ACGTTGGTAC 50 
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TATCGGCCAC GTTGACCACG GTAAAACAAC 
CAGTTTTAGC TAAAACTTAC GGTGGTGCTG 
GATAATGCAC CAGAAGAAAA AGCGCGTGGT 
CGTAGAATAC GATACTCCAA CTCGCCACTA 
GTCACGCCGA CTATGTTAAA AACATGATCA 
GGCGCTATTC TGGTAGTAGC AGCAACTGAT 
TGAGCACATC CTGTTAGGTC GTCAGGTTGG 
TCCTGAACAA ATGTGACATG GTAGATGATG 
GAAATGGAAG TTCGTGAACT TCTGTCTCAA 
CACTCCAGTA ATCCGTGGTT CAGCGCTGAA 
AGTGGGAAGC AT^AAATTGTT GAATTAGCAG 
CCAGAGCCAG AGCGTGCAAT TGACAAACCA 
TGTATTCTCA ATCTCAGGCC GTGGTACAGT 
GTGGTATCAT C7y\AGTAGGT GATGAAGTTG 
ACCACCAAAA CAACTTGTAC TGGCGTTGAA 
CGAAGGTCGT GCAGGTGAGA ACGTAGGTGT 
GTGAAGAAAT CGAACGTGGA CAAGTACTGG 
CCACACAACA AATTTGAATC AGAAGTTTAT 
TGGTCGTCAC ACACCATTCT TCAAAGGCTA 
GTACAACTGA CGTAACTGGT ACTATCGAAT 
GTAATGCCAG GCGACAACGT GAACATGATC 
CGCAATGGAC GA 



PCT/CAOO/01150 

TCTGACTGCT GCAATCACTA 100 

CTCGTGCATT CGACCAAATC 150 

ATCACCATCT CTACTTCACA 200 

CGCACACGTA GACTGCCCAG 250 

CTGGTGCTGC GCAAATGGAC 300 

GGTCCAATGC CACAAACTCG 350 

TGTTCCTTAC ATCATCGTAT 400 

AAGAGCTGTT AGAATTAGTT 450 

TACGATTTCC CAGGTGATGA 500 

AGCACTGGAA GGCGAAGCAG 550 

AAGCACTGGA TTCTTATATC 600 

TTCCTGTTAC CAATCGAAGA 650 

AGTTACTGGT CGTGTAGAGC 700 

AGATTGTTGG TATCAAAGAA 750 

ATGTTCCGTA AATTACTTGA 800 

TCTGCTGCGT GGTACAAAAC 850 

CAAAACCAGG CTCAATCAAC 900 

ATTCTGAGCA AAGATGAAGG 950 

CCGTCCACAG TTCTACTTCC 1000 

TACCAGAAGG CGTAGAAATG 1050 

GTTGAACTGA TCCACCCAAT 1100 

1112 



2) INFORMATION FOR SEQ ID NO: 1527 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 800 bases 

(B) TYPE: Nucleic acid 

(C) STRANDEDNESS: Double 

(D) TOPOLOGY: Linear 

(ii) MOLECULE TYPE: Genomic DNA 

(vi) ORIGINAL SOURCE: 

(A) ORGANISM: Campylobacter curvus 

(B) STRAIN: ATCC 35224 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 1527 

ATCAACGAAG CTATCGAGGT TTATTTTGAG GTTGAGGGCA AGAAAAATAG 50 

ATTGATCCTG GAGGTCGCGG CTCACTTGGG TGATAACCGC GTCAGAACGA 100 

TCGCTATGGA TATGAGTGAG GGGCTTACTC GCGGGCTTGA AGCTACCGCT 150 

CTTGGTGCGC CTATTAGTGT GCCGGTTGGC GAGAAGGTTT TGGGAAGAAT 200 

TTTTAACGTC GTCGGCGATC TCATCGACGA GGGCGAGGGC GTAAATTTTG 250 

ATAAACATTG GTCTATCCAC CGCGATCCGC CACCATTTGA AGAACAAAGC 300 

ACGAAAAGTG AAATTTTTGA AACCGGTATA AAGGTTGTGG ATCTTCTTGC 350 

GCCTTACGCA AAGGGCGGTA AGGTCGGACT ATTTGGCGGT GCAGGTGTCG 400 

GTAAAACGGT CATCATCATG GAGCTCATCC ACAATGTCGC CTTCAAACAC 4 50 

AGCGGATACT CTGTATTTGC AGGTGTTGGC GAGAGGACGC GCGAAGGAAA 500 

CGACCTTTAT CACGAGATGA AAGAAAGTAA CGTTTTGGAT AAAGTCGCCT 550 

TGTGCTACGG ACAGATGAAC GAGCCGCCAG GGGCGAGAAA TCGTATCGCA 600 
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CTGACTGGTC TAACGATGGC TGAGTATTTC 
TGTGCTTATG TTTATCGACA ACATCTTCCG 
AGATGTCGGC ACTCCTCGGA CGTATCCCAT 
ACGCTGGCAA GCGAGATGGG TAAATTTCAA 



CGCGATGAGA TGGGACTTGA 650 

CTTCTCTCAA TCTGGTGCAG 700 

CAGCCGTTGG TTACCAGCCG 750 

GAAAGGATCA CATCGACTAA 800 



2) INFORMATION FOR SEQ ID NO: 1528 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 26 bases 

(B) TYPE: Nucleic acid 

(C) STRANDEDNESS : Single 

(D) TOPOLOGY: Linear 

(ii) MOLECULE TYPE: DNA 

(xi)- SEQUENCE DESCRIPTION: SEQ ID NO: 1528 
AACTTGAGCG ATTTTCGGAT ACCCTG 



2) INFORMATION FOR SEQ ID NO: 1529 

(i) SEQUENCE CHARACTERISTICS:. 

(A) LENGTH: 24 bases 

(B) TYPE: Nucleic acid 

(C) STRANDEDNESS: Single 

(D) TOPOLOGY: Linear 

(ii) MOLECULE TYPE: DNA 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 1529 
TTGCCGATGA AATAACCGCC GACT 



2) INFORMATION FOR SEQ ID NO: 1530 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 1035 bases 

(B) TYPE: Nucleic acid 

(C) STRANDEDNESS: Double 

(D) TOPOLOGY: Linear 

(ii) MOLECULE TYPE: Genomic DNA 

(vi) ORIGINAL SOURCE: 

(A) ORGANISM: Escherichia coli 
(C) ACCESSION NUMBER: M11277 
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(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 1530 



ATGCGATTGG TTTGGAAATG TGGGGCGATT CAGGCATCCC GGTTATCTGA 50 

ATGGCTCAAC TC;y\CAGCCG GTGCTCATGA ACTTGAGCGA TTTTCGGATA 100 

5 CCCTGACCTT TTCTGTGTAT GGCTCAGTGC TGATCTGGCT GAAATCATAT 150 

CTCCGCGAAT CAGGAAGAAA ACTGCAGTTA GTCGGAATCG CCTTACCCAA 200 

CACCCTGAAC CCAAGGGACG ACCTAGCGCA ATTGGCCGAA ATTATCCAGC 250 

TCATCGATCA CCTCATGAAA CCGCACGTTG ATATGTTGAC TCACTTGTTG 300 

GCGTCCATTG ATGGCCAGTC GGCGGTTATT TCATCGGCAA AATGGGGGGA 350 

10 GCTAGAAACG GCTCGGCAGG AGAAAGCTAT CTCAGGGGTA ACCAGATTGA 400 

AGCTCCGCTT GGCGTCGCTT GCCCCCGTCC TGAAAAAACA CGTCAACAGC 450 

GATTTGTTCC GAAAAGCCTC TGATCGAATA GAGTCGATAG AGTATACGTT 500 

GGAAACCTTG CGTATAATGA AAACTTTCTT CGATGGTACC TCTCTTGAGG 550 

GAGATACTTC CGTACGTGAC TCGTATATGG CGGGCGTAGT AGATGGAATG 600 

15 GTTCGAGCGA ATCCGGATGT GAAGATAATT CTGCTGGCGC ACAACAATCA 650 

TCTACAAAAA ACTCCAGTCT CCTTTTCAGG CGAGCTTACG GCTGTTCCCA 700 

TGGGGCAGCA CCTCGCAGAG AGGGTGAATT ACCGTGCGAT TGCATTCACC 750 

CATCTTGGAC CCACCGTGCC GGAAATGCAT TTCCCATCGC CAAAAAGTCC 800 

TCTTGGATTC TCTGTTGTGA CCACGCCTGC CGATGCAATC CGTGAGGATA 850 

20 GTATGGAACA GTATGTCATC GACGCCTGTG GTACGGAGAA TTCATGTCTG 900 

ACATTGACAG ATGCCCCCAT GGTU^GCAAAG CGAATGCGGT CTCAAAGCGC 950 

CTCTGTAGAA ACGAAATTGA GCGAGGCATT TGATGCCATC GTCTGTGTTA 1000 

CAAGCGCCGG CAAGGACAGC CTGGTTGCCC TAT AG 1035 



25 



2) INFORMATION FOR SEQ ID NO: 1531 



(i) SEQUENCE CHARACTERISTICS: 
30 (A) LENGTH: 25 bases 

(B) TYPE: Nucleic acid 

(C) STRANDEDNESS: Single 

(D) TOPOLOGY: Linear 

35 (ii) MOLECULE TYPE: DNA 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 1531 
TCTTTTTGTT ACGACATACG CTTTT 25 

40 



2) INFORMATION FOR SEQ ID NO: 1532 

45 (i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 24 bases 

(B) TYPE: Nucleic acid 

(C) STRANDEDNESS: Single 

(D) TOPOLOGY: Linear 



50 



(ii) MOLECULE TYPE: DNA 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 1532 
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2) INFORMATION FOR SEQ ID NO: 1533 

(i) SEQUENCE CHARACTERISTICS: 
(A) LENGTH: 26 bases 
10 (B) TYPE: Nucleic acid 

(C) STRANDEDNESS : Single 

(D) TOPOLOGY: Linear 



15 



20 



45 



(ii) MOLECULE TYPE: DNA 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 1533- 
CAGCGGATAA AGAAGCACTA CACATT 26 

2} INFORMATION FOR SEQ ID NO: 1534 



(i) SEQUENCE CHARACTERISTICS: 
25 (A) LENGTH: 23 bases 

(B) TYPE: Nucleic acid 

(C) STRANDEDNESS: Single 

(D) TOPOLOGY: Linear 

3 0 (ii) MOLECULE TYPE: DNA 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 1534 
CCTCCTGAAA TAAAGCCCGA CAT 23 

35 



2) INFORMATION FOR SEQ ID NO: 1535 

40 (i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 1260 bases 

(B) TYPE: Nucleic acid 

(C) STRANDEDNESS: Double 

(D) TOPOLOGY: Linear 



(ii) MOLECULE TYPE: Genomic DNA 



(vi) ORIGINAL SOURCE: 

(A) ORGANISM: Escherichia coli 
50 (C) ACCESSION NUMBER: A15097 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 1535 
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ATGAGGTTCG AAGAATGGGT CAAAGATAAG CATATTCCTT NCAAACNGAA 
TCACCCTGAT GATAATTACG ATGATTTTAA GCCATTAAGA AANATAATTG 
GAGATACCCG AGTTGTAGCA TTAGGTGAAA ATTCTCATTT CATAAAAGAA 
TTCTTTTTGT TACGACATAC GCTTTTGCGT TTTTTTATCG AAGATCTAGG 
5 TTTTACTACG TTTGCTTTTG AATTTGGTTT TGCTGAGGGT CAAATCATCA 
ATAACTGGAT ACATGGACAA GGAACTGACG ATGAAATAGG CAGATTCTTA 
AAACACTTCT ATTATCCAGA AGAGCTCAAA ACCACATTTC TATGGCTAAG 
GGAGTACAAT AAAGCAGCAA AAGAAAAAAT CACATTTCTT GGCATTGATA 
TACCCAGAAA TGGAGGTTCA TACTTACCAA ATATGGAGAT AGTGCATGAC 

10 TTTTTTAGAA CAGCGGATAA AGAAGCACTA CACATTATCG ATGATGCATT 
TAATATTGCA AAAAAGATTG ATTACTTCTC CACATCACAG GCAGCCTTAA 
ATTTACATGA GCTAACAGAT TCTGAGAAAT GCCGTTTAAC TAGCCAATTA 
GCTCGAGTAA AAGTTCGCCT TGAAGCTATG GCTCCAATTC ACATTGAAAA 
ATATGGGATT GATAAATATG AGACAATTCT GCATTATGCC AACGGTATGA 

15 TATACTTGGA CTATAACATT CAAGCTATGT CGGGCTTTAT TTCAGGAGGC 
GGAATGCAGG GCGATATGGG TGCAAAAGAC AAATACATGG CAGATTCTGT 
GCTGTCSGCAT TTAAAAAACC CACAAAGTGA GCAGAAAGTG ATAGTAGTAG 
CACATAATGC ACATATTCAA AAAACACCCA TTCTGTATGA TGGATTTCTA 
AGTTGCCTAC CAATGGGCCA AAGACTTAAA AATGCCATTG GTGATGATTA 

20 TATGTCTTTA GGTATTACTT CTTATAGTGG GCATACTGCA GCCCTCTATC 
CGGAAGTTGA TACAAAATAT GGTTTTCGAG TTGATAACTT CCAACTGCAG 
GAACCAAATG AAGGTTCTGT CGAGAAAGCT ATTTCTGGTT GTGGAGTTAC 
TAATTCTTTT GTCTTTTTTA GAAATATTCC TGAAGATTTA CAATCCATCC 
CG/^CATGAT TCGATTTGAN TCTATTTACA TGAAAGCAGA ACTCGAGAAA 

25 GCTTTCGATG GAATATTTCA AATTGAAAAG TCATCTGTAT CTGAGGTCGT 
TTATGAATAA 



30 2) INFORMATION FOR SEQ ID NO: 1536 



50 
100 
150 
200 
250 
300 
350 
400 
450 
500 
550 
600 
650 
700 
750 
800 
850 
900 
950 
1000 
1050 
1100 
1150 
1200 
1250 
1260 



(i) 



SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 25 bases 

(B) TYPE: Nucleic acid 

(C) STRANDEDNESS: Single 

(D) TOPOLOGY: Linear 



35 



(ii) 



MOLECULE TYPE: DNA 



40 



(xi) 



SEQUENCE DESCRIPTION: SEQ ID NO: 1536 



AGATGTATTA ACTGGAAAAC AACAA 



45 



2) INFORMATION FOR SEQ ID NO: 1537 



50 



(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 25 bases 

(B) TYPE: Nucleic acid 

(C) STRANDEDNESS: Single 

(D) TOPOLOGY: Linear 
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(ii) MOLECULE TYPE: DNA 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 1537 
5 CTTTGTAATT AGTTTCTGAA AACCA 25 



10 



40 



50 



2) INFORMATION FOR SEQ ID NO: 1538 



(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 28 bases 

(B) TYPE: Nucleic acid 

(C) STRANDEDNESS : Single 
15 (D) TOPOLOGY: Linear 

(ii) MOLECULE TYPE: DNA 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 1538 

20 

TTAGAAGATA TAGGATACAA AATAGAAG 28 

25 2) INFORMATION FOR SEQ ID NO: 1539 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 22 bases 

(B) TYPE: Nucleic acid 
30 (C) STRANDEDNESS: Single 

(D) TOPOLOGY: Linear 

(ii) MOLECULE TYPE: DNA 
35 (xi) SEQUENCE DESCRIPTION: SEQ ID NO: 1539 

GAATGAAAAA GAAGTTGAGC TT 22 



2) INFORMATION FOR SEQ ID NO: 1540 



(i) SEQUENCE CHARACTERISTICS: 
(A) LENGTH: 4 86 bases 
45 (B) TYPE: Nucleic acid 

(C) STRANDEDNESS: Double 

(D) TOPOLOGY: Linear 



(ii) MOLECULE TYPE: Genomic DNA 

(vi) ORIGINAL SOURCE: 

(A) ORGANISM: Staphylococcus haemolyticus 
(C) ACCESSION NUMBER: M14039 
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(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 1540 

ATGAAAAATA ATAATGTAAC AGAAAAAGAA TTATTTTATA TTTTAGATTT 50 

5 ATTTGAACAC ATGAAAGTAA CTTATTGGTT AGATGGTGGC TGGGGGGTAG 100 

ATGTATTAAC TGGAAAACAA CAAAGAGAAC ACAGAGATAT AGATATAGAT 150 

TTTGACGCTC AACACACTCA AAAAGTTATA CAAAAATTAG AAGATATAGG 200 

ATACAAAATA GAAGTTCATT GGATGCCTTC ACGTATGGAA CTTAAGCATG 250 

AAGAATATGG GTATTTAGAT ATTCATCCTA TAAATCTAAA TGATGATGGA 300 

10 TCAATTACCC AAGCAAACCC AGAAGGTGGT AATTATGTTT TCCAAAATGA 350 

CTGGTTTTCA GAAACTAATT ACAAAGATCG AAAAATACCA TGTATTTCAA 400 

AAGAAGCTCA ACTTCTTTTT CATTCTGGTT ATGATTTAAC AGAAACAGAC 4 50 

CATTTTGATA TAAAAAATTT AAAATCAATA ACATAA 48 6 



15 



50 



2) INFOEIMATION FOR SEQ ID NO: 1541 



(i) SEQUENCE CHARACTERISTICS: 
20 (A) LENGTH: 25 bases 

(B) TYPE: Nucleic acid 

(C) STRANDEDNESS: Single 

(D) TOPOLOGY: Linear 

25 (ii) MOLECULE TYPE: DNA 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 1541 
TGATAATCTT ATACGTGGGG AATTT ' 25 

30 

2) INFORMATION FOR SEQ ID NO: 1542 

35 (i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 26 bases 

(B) TYPE: Nucleic acid. 

(C) STRANDEDNESS: Single 

(D) TOPOLOGY: Linear 

40 

(ii) MOLECULE TYPE: DNA 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 1542 
45 ATAATTTTCT AATTGCCCTG TTTCAT 26 



2) INFORMATION FOR SEQ ID NO: 1543 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 26 bases 

(B) TYPE: Nucleic acid 
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(C) STRANDEDNESS: Single 

(D) TOPOLOGY: Linear 

(ii) MOLECULE TYPE: DNA 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 1543 
GGGCAATTAG AAAATTATTT ATCAGA 26 



2) INFORMATION FOR SEQ ID NO: 1544 



(i) SEQUENCE CHARACTERISTICS: 
15 (A) LENGTH: 26 bases 

(B) TYPE: Nucleic acid 

(C) STRANDEDNESS: Single 

(D) TOPOLOGY: Linear 

20 (ii) MOLECULE TYPE: DNA 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 1544 
TTTTACTCAT GTTTAGCCAA TTATCA 26 

25 



2) INFORMATION FOR SEQ ID NO: 1545 

30 (i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 804 bases 

(B) TYPE: Nucleic acid 

(C) STRANDEDNESS: Double 

(D) TOPOLOGY: Linear 



(ii) MOLECULE TYPE: Genomic DNA 



(vi) ORIGINAL SOURCE: 

(A) ORGANISM: Enterococcus faecium 
40 (C) ACCESSION NUMBER: AF110130 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 1545 

ATGTTAAAAC AAAAAGAATT AATTGCAAAC GTTAAGAATC TTACTGAGTC 50 

45 AGATGAACGA ATTACAGCTT GTATGATGTA TGGATCGTTT ACCAAAGGAG 100 

AAGGTGACCA ATACTCTGAT ATAGAGTTCT ATATATTTTT GAAACATAGT 150 

ATAACCTCGA ACTTTGATTC ATCCAACTGG TTGTTTGACG TAGCTCCGTA 200 

CTTGATGCTT TATAAAAATG AGTACGGAAC AGAGGTAGTT ATTTTTGATA 250 

ATCTTATACG TGGGGAATTT CATTTCCTTT CTGAAAAAGA TATGAACATA 300 

50 ATCCCCTCGT TTAAAGATTC AGGTTATATT CCTGATACGA AGGCTATGCT 350 

TATTTACGAT GAAACAGGGC AATTAGAAAA TTATTTATCA GAGATAAGTG 400 

GTGCAAGACC AAATAGACTT ACTGAAGAAA ATGCTAATTT TTTGTTGTGT 450 

AATTTCTCTA ATCTATGGTT GATGGGAATC AACGTTCTAA AAAGAGGAGA 500 
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ATATGCTCGT TCATTAGAAC 
AACTTATACG TATGGCAGAA 
AAAAACCTTG AAAAAGAAAT 
GACCACTGCT CGATTAGATA 
CTTTGCTATT AGTTATGGAT 
TTAAAAGTTA CACATGACAT 
ATAG 



TCTTATCACA ACTTCAAAAA AATACACTAC 
AAAAATGCTG ATAATTGGCT AAACATGAGT 
TAGCCTTGAA AATTATAAAA AATTTGCAAA 
AGGTAGAATT ATTTGAAGCC TATAAAAATT 
TTGCAAAGTC ACCTTATTGA ACAATACAAC 
TTTAGAAAGA TTGTTGAATT ACATTAGTGA 



550 
600 
650 
700 
750 
800 
804 



2) INFORMATION FOR SEQ ID NO: 1546 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 22 bases 

(B) TYPE: Nucleic acid 

(C) STRANDEDNESS : Single 

(D) TOPOLOGY: Linear 

(ii) MOLECULE TYPE: DNA 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 1546 
CAAGAAGGAA TGGCTGTACT AC 



2) INFORMATION FOR SEQ ID NO: 1547 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 27 bases 

(B) TYPE: Nucleic acid 

(C) STRANDEDNESS: Single 

(D) TOPOLOGY: Linear 

(ii) MOLECULE TYPE: DNA 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 1547 
TAATTCCCAA ATAACCCTAA TAATAGA 



2) INFORMATION FOR SEQ ID NO: 1548 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 1218 bases 

(B) TYPE: Nucleic acid 

(C) STRANDEDNESS: Double 

(D) TOPOLOGY: Linear 

(ii) MOLECULE TYPE: Genomic DNA 
(vi) ORIGINAL SOURCE: 
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(A) ORGANISM: Streptococcus pyogenes 
(C) ACCESSION NUMBER: U70055 



(xi) SEQUENCE DESCRIPTION: SEQ 



ID NO: 1548 



5 



ATGGAAAAAT ACAACAATTG GAAACTTAAG TTTTATACAA TATGGGCAGG 
GCAAGCAGTA TCATTAATCA CTAGTGCCAT CTTGCAAATG GCGATTATTT 
TTTACCTTAC AGAAAAAACT GGATCCGCGA TGGTCTTGTC TATGGCTTCA 
CTATTAGGTT TTTTACCCTA TGCGGTCTTT GGACCTGCAA TTGGTGTGCT 

10 AGTGGATCGT CATGATAGGA AGAAGATAAT GATTGGTGCT GATTTAATTA 
TCGCAGCAGC TGGTTCGGTG CTTACTATTG TTGCATTCTA TATGGAGCTA 
CCTGTCTGGA TGGTTATGAT AGTATTGTTT ATCCGTAGCA TTGGAACAGC 
TTTTCACACC CCGGCTCTCA ATGCGGTTAC GCCACTTTTA GTACCAGAAG 
AACAGCTTAC GAAATGTGCA GGCTATAGTC AGTCTTTGCA GTCTATAAGC 

15 TATATTGTTA GTCCGGCGGT TGCAGCACTC TTATACTCCG TTTGGGAACT 
AAATGCTATT ATTGCCATCG ATGTATTGGG TGCTGTGATT GCATCTATTA 
CGGTAGCAAT TGTACGTATT CCTAAGCTGG GTGATCGCGT GCAAAGTTTG 
GACCCAAATT TCATAAGAGA AATGCAAGAA GGAATGGCTG TACTACGGCA 
AAATAAAGGA TTATTTGCTT TATTACTCGT TGGAACATTA TATATGTTTG 

20 TTTATATGCC AATTAATGCA CTATTCCCTT TAATTAGCAT GGATTACTTT 
AATGGAACAC CTGTGCATAT TTCTATTACG GAAATTTCCT TTGCATCTGG 
AATGTTGATA GGGGGTCTAT TATTAGGGTT ATTTGGGAAT TACCAAAAGC 
GAATCTTATT AATAACGGCA TCCATTTTTA TGATGGGGAT AAGCTTAACC 
ATTTCAGGAT TACTTCCCCA AAGTGGATTT TTCATTTTTG TAGTCTGCTG 

25 TGCAATAATG GGGCTTTCTG TTCCGTTTTA CAGCGGTGTG CAAACAGCTC 
TTTTTCAGGA GAAAATTAAG CCTGAATATT TAGGACGTGT ATTTTCTTTA 
ACTGGAAGTA TCATGTCTCT TGCTATGCCA ATTGGATTAA TTCTTTCTGC 
ACTCTTTGCT GATAGAATCG GTGTAAATCA TTGGTTTTTA CTATCAGGTA 
CTTTT^TTAT TTGCATTGCA ATAGTTTGCC CAATGATAAA TGAGATTAGA 

30 AAATTAGATT TAAAATAA 



2) INFORMATION FOR SEQ ID NO: 1549 



50 
100 
150 
200 
250 
300 
350 
400 
450 
500 
550 
600 
650 
700 
750 
800 
850 
900 
950 
1000 
1050 
1100 
1150 
1200 
1218 



35 



40 



SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 26 bases 

(B) TYPE: Nucleic acid 

(C) STRANDEDNESS: Single 

(D) TOPOLOGY: Linear 



(ii) 



MOLECULE TYPE: DNA 



(xi) 



SEQUENCE DESCRIPTION: SEQ ID NO: 1549 



45 



GCTTATTATT AGGAAGATTA GGGGGC 



50 



2) INFORMATION FOR SEQ ID NO: 1550 



(i) SEQUENCE CHARACTERISTICS: 
(A) LENGTH: 24 bases 
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(B) TYPE: Nucleic acid 

(C) STRANDEDNESS: Single 

(D) TOPOLOGY: Linear 

(ii) MOLECULE TYPE: DNA 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 1550 
TAGCAAGTGA CATGATACTT CCGA 



2) INFORMATION FOR SEQ ID NO: 1551 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 1218 bases 

(B) TYPE: Nucleic acid 

(C) STRANDEDNESS: Double 

(D) TOPOLOGY: Linear 

(ii) MOLECULE TYPE: Genomic DNA 

(vi) ORIGINAL SOURCE: 

(A) ORGANISM: Streptococcus pneumoniae 
(C) ACCESSION NUMBER: U83667 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 1551 

ATGGAAAAAT ACAACAATTG GAAACGAAAA TTTTATGCAA TATGGGCAGG 50 

GCAAGCAGTA TCATTAATCA CTAGTGCCAT CCTGCAAATG GCGATTATTT 100 

TTTACCTTAC AGAAAAAACA GGATCTGCGA TGGTCTTGTC TATGGCTTCA 150 

TTAGTAGGTT TTTTACCCTA TGCGATTTTG GGACCTGCCA TTGGTGTGCT 200 

AGTGGATCGT CATGATAGGA AGAAGATAAT GATTGGTGCC GATTTAATTA 250 

TCGCAGCAGC TGGTGCAGTG CTTGCTATTG TTGCATTCTG TATGGAGCTA 300 

CCTGTCTGGA TGATTATGAT AGTATTGTTT ATCCGTAGCA TTGGAACAGC 350 

TTTTCATACC CCAGCACTCA ATGCGGTTAC ACCACTTTTA GTACCAGAAG 400 

AACAGCTAAC GAAATGCGCA GGCTATAGTC AGTCTTTGCA GTCTATAAGC 450 

TATATTGTTA GTCCGGCAGT TGCAGCACTC TTATACTCCG TTTGGGATTT 500 

AAATGCTATT ATTGCCATCG ACGTATTGGG TGCTGTGATT GCATCTATTA 550 

CGGTAGCAAT TGTACGTATA CCTAAGCTGG GTAATCAAGT GCAAAGTTTA 600 

GAACCAAATT TCATAAGGGA GATGAAAGAA GGAGTTGTGG TTCTGAGACA 650 

AAACAAAGGA TTGTTTGCCT TATTACTCTT AGGAACACTA TATACTTTTG 700 

TTTATATGCC AATCAATGCA CTATTTCCTT TAATAAGCAT GGAACACTTT 7 50 

AATGGAACGC CTGTGCATAT TTCTATTACG GAAATTTCCT TTGCATTTGG 800 

GATGCTAGCA GGAGGCTTAT TATTAGGAAG ATTAGGGGGC TTCGAAAAGC 850 

ATGTATTACT AATAACAAGT TCATTTTTTA TAATGGGGAC CAGTTTAGCC 900 

GTTTCGGGAA TACTTCCTCC AAATGGATTT GTAATATTCG TAGTTTGCTG 950 

TGCAATAATG GGGCTTTCGG TGCCATTTTA TAGCGGTGTG CAAACAGCTC 1000 

TTTTTCAGGA GAAAATTAAG CCTGAATATT TAGGACGTGT ATTTTCTTTG 1050 

ATCGGAAGTA TCATGTCACT TGCTATGCCA ATTGGGTTAA TTCTTTCTGG 1100 

ATTCTTTGCT GATAAAATCG GTGTAAATCA TTGGTTTTTA CTATCAGGTA 1150 

TTTTAATTAT TGGCATTGCT ATAGTTTGCC AAATGATAAC TGAGGTTAGA 1200 

AAATTAGATT TAAAATAA 1218 

801 



SUBSTITUTE SHEET (RULE 26) 



wo 01/23604 



PCT/CAOO/01150 



2) INFORMATION FOR SEQ ID NO: 1552 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 24 bases 

(B) TYPE: Nucleic acid 

(C) STRANDEDNESS: Single 

(D) TOPOLOGY: Linear 

(ii) MOLECULE TYPE: DNA 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 1552 
GGCAAGCAGT ATCATTAATC ACTA 



2) INFORMATION FOR SEQ ID NO: 1553 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 26 bases 

(B) TYPE: Nucleic acid 

(C) STRANDEDNESS: Single 

(D) TOPOLOGY: Linear 

(ii) MOLECULE TYPE: DNA 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 1553 
CAATGCTACG GATAAACAAT ACTATC 



2) INFORMATION FOR SEQ ID NO: 1554 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 27 bases 

(B) TYPE: Nucleic acid 

(C) STRANDEDNESS: Single 

(D) TOPOLOGY: Linear 

(ii) MOLECULE TYPE: DNA 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 1554 
AGAAAATTAA GCCTGAATAT TTAGGAC 



2) INFORMATION FOR SEQ ID NO: 1555 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 25 bases 

(B) TYPE: Nucleic acid 
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(C) STRANDEDNESS: Single 

(D) TOPOLOGY: Linear 

(ii) MOLECULE TYPE: DNA 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 1555 
TAGTAAAAAC CAATGATTTA CACCG 25 



2) INFORMATION FOR SEQ ID NO: 1556 



(i) SEQUENCE CHARACTERISTICS: 
15 (A) LENGTH: 25 bases 

(B) TYPE: Nucleic acid 

(C) STRANDEDNESS: Single 

(D) TOPOLOGY: Linear 

20. (ii) MOLECULE TYPE: DNA 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 1556 
ACTGTACGCA CTTGCAGCCC GACAT 25 

25 

2) INFORMATION FOR SEQ ID NO: 1557 

30 (i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 24 bases 

(B) TYPE: Nucleic acid 

(C) STRANDEDNESS: Single 

(D) TOPOLOGY: Linear 

35 

(ii) MOLECULE TYPE: DNA 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 1557 
4 0 GAACGGCAGG CGATTCTTGA GCAT 24 



2) INFORMATION FOR SEQ ID NO: 1558 



(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 22 bases 

(B) TYPE: Nucleic acid 

(C) STRANDEDNESS: Single 
50 (D) TOPOLOGY: Linear 

(ii) MOLECULE TYPE: DNA 
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GTGGTGGTGC ATGGCGATCT CT 
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2) INFORMATION FOR SEQ ID NO: 1559 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 24 bases 

(B) TYPE: Nucleic acid 

(C) STRANDEDNESS: Single 

(D) TOPOLOGY: Linear 

(ii) MOLECULE TYPE: DNA 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 1559 
GCCGCAGCGA GGTACTCTTC GTTA 



2) INFORMATION FOR SEQ ID NO: 1560 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 906 bases 

(B) TYPE: Nucleic acid 

(C) STRANDEDNESS: Double 

(D) TOPOLOGY: Linear 

(ii) MOLECULE TYPE: Genomic DNA 

(vi) ORIGINAL SOURCE: 

(A) ORGANISM: Escherichia coli 
(C) ACCESSION NUMBER: D16251 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 1560 

ATGACCGTAG TCACGACCGC CGATACCTCC CAACTGTACG CACTTGCAGC 50 

CCGACATGGG CTCAAGCTCC ATGGCCCGCT GACTGTCAAT GAGCTTGGGC 100 

TCGACTATAG GATCGTGATC GCCACCGTCG ACGATGGACG TCGGTGGGTG 150 

CTGCGCATCC CGCGCCGAGC CGAGGTAAGC GCGAAGGTCG AACCAGAGGC 200 

GCGGGTGCTG GCAATGCTCA AGAATCGCCT GCCGTTCGCG GTGCCGGACT 250 

GGCGCGTGGC CAACGCCGAG CTCGTTGCCT ATCCCATGCT CGAAGACTCG 300 

ACTGCGATGG TCATCCAGCC TGGTTCGTCC ACGCCCGACT GGGTCGTGCC 350 

GCAGGACTCG GAGGTCTTCG CGGAGAGCTT CGCGACCGCG CTCGCCGCCC 400 

TGCATGCCGT CCCCATTTCC GCCGCCGTGG ATGCGGGGAT GCTCATCCGT 450 

ACACCGACGC AGGCCCGTCA GAAGGTGGCC GACGACGTTG ACCGCGTCCG 500 

ACGCGAGTTC GTGGTGAACG ACAAGCGCCT CCACCGGTGG CAGCGCTGGC 550 

TCGACGACGA TTCGTCGTGG CCAGATTTCT CCGTGGTGGT GCATGGCGAT 600 

CTCTACGTGG GCCATGTGCT CATCGACAAC ACGGAGCGCG TCAGCGGGAT 650 

GATCGACTGG AGCGAGGCCC GCGTTGATGA CCCTGCCATC GACATGGCCG 700 

CGCACCTTAT GGTCTTTGGT GAAGAGGGGC TCGCGAAGCT CCTCCTCACG 750 
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TATGAAGCGG CCGGTGGCCG 
GGAGCGCCTT GCGTTCGGGG 
CGGGTAACGA AGAGTACCTC 
GAATGA 



GGTGTGGCCG CGGCTCGCCC 
CGGTCACCTA CGCACTCTTC 
GCTGCGGCGA AGGCGCAGCT 
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ACCACATCGC 8 00 

GCCCTCGACT 850 
CGCCGCAGCG 900 

906 



2) INFORMATION FOR SEQ ID NO: 1561 



(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 1048 bases 

(B) TYPE: Nucleic acid 

(C) STRANDEDNESS: Double 

(D) TOPOLOGY: Linear 



(ii) MOLECULE TYPE: Genomic DNA 

(vi) ORIGINAL SOURCE: 

(A) ORGANISM: Candida albicans 

(B) STRAIN: ATCC 18804 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 1561 



GATCATGGTA AAACTACATT GACTGCTGCT 
ACAAGGTGGT GCCAACTTCT TGGATTAYGG 
AAGAAAGAGC TAGAGGTATC ACTATTTCCA 
ACCAAGAACA GACACTATGC CCACGTTGAT 
TATCAAAAAT ATGATTACTG GTGCCGCTCA 
TTGTTGCTGC CACTGATGGT CAAATGCCTC 
TTGGCCAGAC AAGTTGGTGT TCAAGACTTG 
CGATACTATT GATGACCCTG AAATGTTGGA 
GAGAATTGTT ATCCACCTAC GGTTTTGATG 
ATGGGATCTG CTTTAATGGC TTTGGAAGAC 
GGAAGCTATC TTGAAATTGT TAGATGCTGT 
CATCAAGAGA CTTGGAACAA CCATTTTTGT 
TCCATCTCCG GTAGAGGAAC TGTTGTCACT 
TTTGAAGAAG GGTGAAGAAA TCGAAATTGT 
ACAAGACTAC TGTTACCGGT ATTGAAATGT 
GCTATGGCTG GTGACAACTG TGGTGTTTTG 
TGAAATCAAG AGAGGTATGG TTTTGGCCAA 
ACAAGAAGTT CTTGGCTTCC TTGTATATTT 
CGTTCCACTC CATTTGGTGA AGGTTACAAG 
TAACGATGTC ACTACCACAT TTTCATTCCC 
ATTCTCAAAT GATCATGCCA GGTGACAACA 



ATCACCAAAG 


TTTTAGCCGA 


50 


TTCTATTGAT 


AGAGCTCCAG 


100 


CTGCCCACGT 


TGAATACGAA 


150 


TGTCCAGGAC 


ACGCTGATTA 


200 


AATGGATGGT 


GCTATCATTG 


250 


AAACCAGAGA 


ACATTTGTTA 


300 


GTTGTGTTTG 


TCAACAAAGT 


350 


ATTAGTCGAA 


ATGGAAATGA 


400 


GTGACAACAC 


TCCAGTTATT 


450 


AAGAAACCAG 


AAATTGGTAA 


500 


CGATGAACAC 


ATTCCAACTC 


550 


TACCAGTTGA 


AGACGTGTTC 


600 


GGTAGAGTTG 


AAAGAGGTGT 


650 


TGGTGGTTTT 


GACAAACCTT 


700 


TCAAAAAAGA 


ATTAGACTCT 


750 


TTAAGAGGTG 


TTAAAAGAGA 


800 


ACCAGGTACT 


GCCACTTCTC 


850 


TGACTTCCGA 


AGAAGGTGGY 


900 


CCTCAATGCT 


TCTTCAGAAC 


950 


AGAAGGAGAA 


GGTGTTGACC 


1000 


TTGAAATGGT 


TGGTGAAT 


1048 



2) INFORMATION FOR SEQ ID NO: 1562 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 1074 bases 

(B) TYPE: Nucleic acid 

(C) STRANDEDNESS: Double 
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(D) TOPOLOGY: Linear 
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(ii) MOLECULE TYPE: Genomic DNA 

(vi) ORIGINAL SOURCE: 

(A) ORGANISM: Candida dublinlensis 

(B) STRAIN: NCPF 3949 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 1562 



TGATCACGGT AAAACCACAT TAACTGCTGC CATTACCAAA GTATTAGCTG 50 

AACAAGGTGG TGCCAACTTT TTGGATTACG GTTCCATTGA TAGAGCTCCA 10 0 

GAAGAAAGAG CCAGAGGTAT CACTATTTCC ACTGCCCACG TTGAATACGA 150 

AACCAAGAAC AGACACTATG CCCACGTTGA TTGTCCAGGA CACGCTGATT 200 

15 ATATCAAAAA CATGATTACT GGTGCTGCTC AAATGGATGG TGCTATCATT 2 50 

GTTGTTGCTG CTACTGACGG TCAAATGCCA CAAACCAGAG AACATTTATT 300 

GTTGGCAAGA CAAGTTGGTG TTCAAGACTT GGTTGTCTTT GTCAACAAAG 350 

TTGATACTAT TGATGACCCT GAGATGTTGG AATTAGTCGA AATGGAT^TG 4 00 

AGAGAATTGT TGTCCACCTA CGGTTTTGAT GGTGACAACA CTCCTGTTAT 450 

20 TATGGGATCT GCTTTAATGG CCTTGGAAGG CAAAAAACCA GAAATTGGTA 500 

AGGAAGCTAT TTTGAGATTG TTAGATGCTG TCGATGAACA CATTCCAACT 550 

CCATCAAGAG ACTTGGAACA ACCATTTTTG TTGCCAGTTG AAGACGTGTT 600 

CTCCATCTCT GGTAGAGGAA CTGTTGTCAC CGGTAGAGTT GAAAGAGGTG 650 

TCTTGAAGAA GGGTGAAGAA ATCGAAATTG TTGGTGGTTT TGACAAACCA 700 

25 TACAAGACCA CTGTTACTGG TATTGAAATG TTCAAAAAGG AATTAGATTC 750 

TGCTATAGCT GGTGACT^CT GTGGTGTTTT GTTGAGAGGT GTTAAAAGAG 800 

ATGAAATCAA GAGAGGTATG GTTTTGGCCA AGCCAGGTAC TGCTACTTCT 850 

CACAAGAAAT TTTTAGCATC TTTGTATATT TTGACTTCAG AAGAAGGTGG 90 0 

TCGTTCCACT CCATTTGGAG AAGGTTACAA GCCTCAATGT TTCTTCAGAA 950 

30 CTAATGACGT CACTACCACA TTTTCATTCC CAGAAGGAGA AGGTGTTGAC 1000 

CACTCCCAAA TGGTCATGCC AGGTGATAAC ATTGAAATGG TTGGTGAATT 1050 

GATCAAATCA TGTCCATTGG AAGT 1074 



35 



45 



2) INFORMATION FOR SEQ ID NO: 1563 



(i) SEQUENCE CHARACTERISTICS: 
(A) LENGTH: 1033 bases 
40 (B) TYPE: Nucleic acid 

(C) STRANDEDNESS: Double 

(D) TOPOLOGY: Linear 



(ii) MOLECULE TYPE: Genomic DNA 

(vi) ORIGINAL SOURCE: 

(A) ORGANISM: Candida famata 

(B) STRAIN: ATCC 62894 

50 (xi) SEQUENCE DESCRIPTION: SEQ ID NO: 1563 

GATCACGGGA AGACTACTTT GACCGCTGCC ATCACCAAAG TTTTAGCCGA 50 
AAAAGGTGGT GCTAACTTCT TGGACTACGG TTCTATCGAT AAAGCTCCAG 100 
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AAGAAAGAGC CAGAGGTATT ACTATTTCTG 
ACTGACAAGA GACACTATGC CCATGTTGAT 
TATCAAGAAT ATGATTACTG GTGCTGCTCA 
TTGTTGCTGC TTCCGATGGT CAAATGCCTC 
TTGGCCAGAC AAGTTGGTGT TCAACACTTG 
CGACACCATT GACGATCCAG AAATGTTGGA 
GAGATTTGTT AACTACTTAC GGTTTTGATG 
ATGGGATCTG CTTTGTGTGC TTTGGAATCC 
AAAAGCCATT GAAAAATTGT TAGATGCCGT 
CAGTCAGAGA CTTGGAACAA CCATTCTTGA 
TCCATTTCCG GTAGAGGTAC CGTTGTTGCT 
CTTGAAGAAG GGTGAAGAAA TCGAAATTGT 
TCAAGGCCAC TGTTACTGGT ATTGAAATGT 
GCTCTTGCTG GTGACAACTG TGGTATCTTG 
CGAAGTTAAG AGAGGTATGG TCTTGACCAA 
ACAAGAAGAT CTTGGCCTCG TTGTATATCT 
AGACACTCTC CATTTGGAGC CAACTACAAG 
CACCGATGTT ACCGGTACCA TGACCTTCCC 
CCATGGTCAT GCCAGGTGAC AACGTTGAAA 



PCT/CAOO/01150 

CTGCCCATGT TGAATACGAA 150 

TGTCCAGGTC ACGCAGATTA 2 00 

AATGGATGGT GCCATTATTG 250 

AAACCAGAGA ACATTTGTTA 300 

GTTGTTTTCG TCAACAAGGT 350 

ATTGGTTGAA ATGGAAATGA 400 

GTGATAACAC CCCAGTTATC 450 

AGAGAACCAG AAATTGGTCA 500 

CGATGAATAC ATTCCAACCC 550 

TGCCAGTTGA AGAAGTTTTC 600 

GGTAGAGTCG AAAGAGGTAC 650 

TGGTGGTTTC GACAAGCCAT 700 

TCAAGAAGGA ATTGGACTCC 750 

TTGAGAGGTG TCAAGAGAGA 800 

GCCAAACACC GTCACTTCCC 850 

TGACCAAGGA AGAAGGTGGT 900 

CCCCAATTGT TCATGAGAAC 950 

AGAAGGTGCC GACCAATCTG 1000 

TGC 1033 



2) INFORMATION FOR SEQ ID NO: 1564 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 1056 bases 

(B) TYPE: Nucleic acid 

(C) STRANDEDNESS: Double 

(D) TOPOLOGY: Linear 

(ii) MOLECULE TYPE: Genomic DNA 

(vi) ORIGINAL SOURCE: 

(A) ORGANISM: Candida glahrata 

(B) STRAIN: ATCC 66032 

(xi) SEQUENCE DESCRIPTION: SE 



ID NO: 1564 






ATCACCAAGA 


CATTGGCCAA 


50 


TTCCATTGAC 


AAAGCTCCAG 


100 


CTGCCCATGT 


CGAGTACGAG 


150 


TGTCCAGGTC 


ACGCCGACTA 


200 


AATGGACGGT 


GCTATCATCG 


250 


AAACTAGAGA 


GCATTTGCTG 


300 


GTTGTCTTTG 


TCAACAAGGT 


350 


ATTAGTGGAA 


ATGGAAATGA 


400 


GTGACAATGC 


CCCTATCATT 


450 


CGTCAACCTG 


AAATTGGTGA 


500 


TGATGAATAC 


ATTCCAACCC 


550 


TGCCTGTTGA 


AGACATCTTC 


600 


GGACGTGTCG 


AAAGAGGTAA 


650 


TGGTCACAAC 


ACTACCCCAT 


700 


TCAGAAAGGA 


ATTGGACCAA 


750 
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GCTATGGCTG GTGACAACGC 
CCAATTGAAG AGAGGTATGG 
ACACCAAGAT TTTGGCTTCT 
AGACATTCTG GTTTCGGTGA 
CGCAGATGTC ACTGTTGTGA 
CTATGCAAGT TATGCCAGGT 
CACCCA 



CGGTATCCTA TTGAGAGGTA 
TCATGGCCAA GCCAGGTACC 
TTGTACATCT TGTCTAAGGA 
AAACTACAGA CCTCAGATGT 
TG7AGTTCCC AGAATCTGTG 
GACAACGTCG AAATGGTCTG 



TCAGAAGAGA 800 

GTCAAGGCTC 850 

AGAAGGTGGT 900 

TTATCAGAAC 950 

GAAGACCACT 10 00 

TGAACTAGTC 1050 
1056 



2) INFORMATION FOR SEQ ID NO: 1565 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 1061 bases 

(B) TYPE: Nucleic acid 

(C) STRANDEDNESS: Double 

(D) TOPOLOGY: Linear 

(ii) MOLECULE TYPE: Genomic DNA 

(vi) ORIGINAL SOURCE: 

(A) ORGANISM: Candida guilliermondii 

(B) STRAIN: ATCC 6260 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 1565 

GATCATGGTA AGACCACTTT GACCGCTGCT ATCACCAAGG TTTTGTCGGA 50 

AAAAGGAGGT GCTAATTTCT TGGATTACGG CTCCATCGAC AGAGCTCCAG 100 

AAGAGAGAGC CAGAGGTATC ACCATTTCCA CTGCCCATGT TGAGTACCAA 150 

ACTGATAAGA GACATTATGC CCACGTTGAC TGTCCAGGTC ACGCCGATTA 200 

CATTAAGAAT ATGATTACTG GTGCCGCCCA GATGGACGGT GCCATTATTG 250 

TTGTTGCTGC CACTGACGGT CAAATGCCTC AGACCAGAGA GCACTTGTTG 300 

TTGGCCAGAC AAGTTGGTGT GCAACACTTG GTAGTTTTTG TGAACAAGGT 350 

GGACACCATT GACGATCCCG AGATGTTGGA ATTGGTCGAG ATGGAAATGA 400 

GAGAATTGTT GAGTCAGTAC GGTTTCGATG GTGACAACAC CCCAGTTATC 450 

ATGGGATCTG CTTTGTGTGC TTTGGAAAGT AAGCAGCCAG AAATTGGTGT 500 

GCAAGCCATT GAAAAATTGT TGGACGCTGT CGATGAGCAC ATTCCTACTC 550 

CTACCCGTGA CTTGGAACAG CCATTCTTGT TGCCTGTTGA AGATGTGTTC 600 

TCCATTTCTG GTAGAGGAAC TGTGGTTACT GGTAGAGTCG AAAGAGGTTC 650 

GTTGAAGAAG GGTGAGGAAA TCGAGATTGT TGGTGACTTT GACAAGCCAT 700 

TCAAGACCAC TGTGACTGGA ATTGAAATGT TCAAGAAGGA ATTGGATGCT 750 

GCTATGGCTG GTGACAATGC TGGTATCTTG TTGAGAGGTG TCAAGAGAGA 800 

CGATGTCAAG AGAGGTATGG TTTTGGCCAA GCCTTCCACC GTCACTTCTC 850 

ACAAGAAGGT GTTGGCTTCC TTGTACATCT TGAGTAAGGA AGAAGGTGGC 900 

CGTCACTCTC CTTTTGGTGA GAACTACAAG CCTCAATTGT TCATCAGAAC 950 

TACTGACGTT ACCGGTACTT TAAGATTCCC AGCCGGCGAG GGTGTCGACC 1000 

ACTCGCA7VAT GGTTATGCCA GGTGACAATG TTGAGATGGA AATTGAGCTT 1050 

GTGAGAAAGA C ^^^^ 



2)INFOE^TION FOR SEQ ID NO: 1566 
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(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 1073 bases 

(B) TYPE: Nucleic acid 

(C) STRANDEDNESS: Double 

(D) TOPOLOGY: Linear 

(ii) MOLECULE TYPE: Genomic DNA 

(vi) ORIGINAL SOURCE: 

(A) ORGANISM: Candida haemulonii 

(B) STRAIN: ATCC 22991 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 1566 

GATCACGGTA AGACTACCTT GACTGCTGCT ATCACCAAGG TTTTGGCTTC 50 

TAAGGGTGGT GCTAGCTTCT TGGACTATGG TTCCATTGAC AGAGCCCCAG 100 

AGGAGAGAGC TAGAGGTATT ACTATTTCTA CTGCCCACGT TGAGTACCAA 150 

ACCGAAAAGA GACACTACGC CCACGTCGAC TGTCCAGGTC ACGCTGATTA 2 00 

CATTAAGAAT ATGATTACTG GTGCCGCCCA GATGGACGGT GCTATCATTG 2 50 

TTGTTGCTGC TTCTGATGGT CAGATGCCTC AGACCAGAGA GCACCTTTTG 300 

TTGGCCAGAC AGGTTGGTGT TCAGAACTTG GTTGTTTTCG TTAACAAGGT 350 

TGACACCATT GACGACCCTG AAATGTTGGA ATTGGTTGAG ATGGAAATGA 400 

GAGAATTGTT GACTACTTAC GGTTTTGACG GTGATGAGAC TCCTGTTATC 450 

ATGGGTTCTG CTTTGTGCGC TTTGGAAGAG AAGCAACCAG AGATTGGTGA 500 

GCAGGCTATC ATGAAGTTGT TGGACGCTGT CGATGAGTAC ATTCCAACCC 550 

CACAGCGTGA CTTGGAGCAG CCATTCTTGA TGCCTGTTGA GGATGTTTTC 600 

TCCATTTCTG GTAGAGGTAC TGTCGTTACT GGTAGAGTTG AGAGAGGTTC 650 

TTTGAAGAAG GGTGAGGAAA TCGAGATTGT CGGTGACTTC GCCAAGACTT 70 0 

TCAAGGCTAC CGTTACTGGT ATTGAGATGT TCAAGAAGGA ATTGGATGCT 750 

GCTATGGCTG GTGACAACGC CGGTATCTTG TTGAGAGGTG TCAAGAGAGA 800 

TGAGATCTCC CGTGGTGATG TCTTGGCCAA GCCAGGTACT GTTACTCCAC 850 

ACAAGAAGAT CTTGGCTTCT TTGTACGTTT TGACCAAGGA AGAAGGTGGT 90 0 

CGTCACAACC CATTCGCTGA GAACTACAAG CCACAGTTGT TCCTCAGAAC 950 

CACCAACGTC ACTGGTACCA TGAGATTCCC AGAAGGTGAA GATGTTGACC 1000 

ACTCTGCCAT GGTTAACCCA GGTGACAACG TTGAGATGGA AATCGAGTTG 1050 

GGTAGAAAGG CCCCACTTGA GTT 1073 



2) INFORMATION FOR SEQ ID NO: 1567 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 1062 bases 

(B) TYPE: Nucleic acid 

(C) STRANDEDNESS: Double 

(D) TOPOLOGY: Linear 

(ii) MOLECULE TYPE: Genomic DNA 

(vi) ORIGINAL SOURCE: 

(A) ORGANISM: Candida kefyr 

(B) STRAIN: ATCC 28838 
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(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 1567 

CATGGTAAGA CCACTTTGAC TGCTGCCATC ACCAAGACTC TAGCTGAACG 50 

TGGTGGTGCT GACTTTTTGG ACTACTCTTC TATTGACAAG GCTCCAGAAG 100 

5 AAAGAGCYAG AGGTATCACT ATTTCTACTG CTCATGTTGA ATACGAGACT 150 

GAAAAGAGAC ATTACTCCCA CGTTGACTGT CCAGGTCACG CTGATTACAT 20 0 

CAAGT^CATG ATTACTGGTG CTGCTCAAAT GGACGGTGCT ATTATTGTTG 250 

TTGCTGCTAC TGATGGTCAA ATGCCTCAAA CCAGAGAGCA TTTGTTGTTG 300 

GCCAGACAAG TTGGTGTCCA ACACATTGTT GTTTTCGTTA ACAAGGTTGA 350 

10 CACCATCGAT GATCCAGAJ\A TGTTGGAATT GGTTGAAATG GAAATGAGAG 400 

AATTRTTGAC TCAATATGGC TTTGACGGTG ACAACACTCC AGTGATCATG 450 

GGTTCTGCTT TGTGTGCCTT GGAAGGTAAG CAACCAGAAA TTGGTGAGCA 500 

AGCCATCATG AAGTTGTTGG ACGCTGTTGA CGAATACATC CCAACCCCAG 550 

CCCGTGACTT GGAAAAVCCA TTCTTGATGC CTGTTGAAGA TATCTTCTCC 600 

15 ATTTCCGGTA GAGGTACTGT CGTCACTGGT AGAGTTGAAC GTGGTAACTT 650 

GAAGAAGGGT GAAGAAATCG AAATTGTTGG TCACAACACC ACTCCTTTCA 700 

AGACTACTGT TACTGGTATT GAAATGTTCA GAAAGGAATT GGACCAAGCC 750 

ATGGCTGGTG ACAACGCTGG TGTCCTTTTG AGAGGTGTCA GAAGAGACCA 800 

ATTGAAGAGA GGTATGGTTT TGGCTAAGCC AGGTACTGTT AAGGCCCACA 850 

20 CCAAGTTCTT GGCTTCCTTG TACATTTTGA CCAAGGAAGA AGGTGGTAGA 900 

CACTCCGGTT TCGGTGAAT^ CTACAGACCA CAAATCTACG TCAGAACTGC 950 

TGACGTTACC GTDGTCTTGA AGTTCCCAGA ATCTGTTGAA GACCATTCCA 1000 

TGCAAGTCAT GCCAGGTGAC AATGTCGAAA TGGAGTGTGA ATTGGTTCAC 1050 

CCAACTCCAT TG 1062 

25 



2) INFORMATION FOR SEQ ID NO: 1568 

30 (i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 1062 bases 

(B) TYPE: Nucleic acid 

(C) STRANDEDNESS : Double 

(D) TOPOLOGY: Linear 



35 



(ii) MOLECULE TYPE: Genomic DNA 



(vi) ORIGINAL SOURCE: 

(A) ORGANISM: Candida lusitaniae 
40 (B) STRAIN: ATCC 66035 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 1568 

CGGAAAGACC ACCTTGACCG CCGCCATTAC CAAGGTTTTG GCTGACAAGG 50 

45 GTGGCGCCAA CTTCTTGGAC TACGGTGCCA TTGACAAGGC TCCTGAAGAA 10 0 

AGAGCACGTG GTATCACCAT TTCCACTGCC CACGTTGAAT ACGAGACCGA 150 

CAACAGACAC TACGCCCACG TTGACTGTCC AGGTCACGCC GATTACATCA 2 00 

AGAACATGAT CACGGGTGCC GCTCAAATGG ACGGTGCCAT TATTGTTGTT 2 50 

GCAGCCACCG ACGGCCAAAT GCCTCAAACC AGAGAGCACT TGTTGTTGGC 3 00 

50 CAGACAAGTT GGTGTGCAAC ACTTGGTTGT TTTCGTGAAC AAGGTTGATA 350 

CCATCGACGA CCCAGAAATG TTGGAATTGG TTGAAATGGA AATGAGAGAA 4 00 

TTGTTGACTC AATACGGATT TGACGGCGAT GAAACCCCTG TTGTTATGGG 4 50 

CTCTGCTCTT TGCGCTTTGG AAGGTAGAGA ACCAGAGATT GGTGAGCAAG 500 
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CCATCACCAA 
CGTGACTTGG 
TTCTGGTAGA 
AGAAGGGTGA 
ACTACTGTTA 
GGCTGGTGAC 
TTTCCCGTGG 
AAGGTTTTGG 
CTCTCCATTT 
ATGTCACTGG 
GCTATGGTTT 
AAAGACTCCT 



GTTGTTGGAG 
AACAACCATT 
GGTACTGTTG 
GGAGATCGAG 
CTGGTATTGA 
AATGCTGGTA 
TATGGTTTTG 
CTTCTTTGTA 
GGCGAGAACT 
TACTTTGAGA 
CTCCAGGTGA 
CT 



GCTGTTGACG AGTACATCCC AACCCCACAA 
CTTGATGCCT GTTGAAGATG TTTTCTCCAT 
TCACTGGTAG AGTGGAGAGA GGTTCCTTGA 
ATTGTTGGTG ACTTTGACAA GCCTTTCAAG 
GATGTTCAAG AAGGAATTGG ACGCTGCTAT 
TCTTGTTGAG AGGTGTCAAG AGAGAACAAG 
GCCAAGCCAG GCACTGTGAC CTCGCACAAG 
CATTTTGTCT AAGGAAGAAG GTGGTCGTCA 
ACAAGCCTCA ATTGTTCCTT AGAACTACCG 
TTCCCAGCAG GTGAGGACGT TGACCACTCC 
CAATGTCGAG ATGGAAATCG AGTTGGTCAG 



550 
600 
650 
700 
750 
800 
850 
900 
950 
1000 
1050 
1062 



2) INFORMATION FOR SEQ ID NO: 1569 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 990 bases 

(B) TYPE: Nucleic acid 

(C) STRANDEDNESS: Double 

(D) TOPOLOGY: Linear 

(ii) MOLECULE TYPE: Genomic DNA 

(vi) ORIGINAL SOURCE: 

(A) ORGANISM: Candida sphaerica 

(B) STRAIN: ATCC 2504 

(ii) MOLECULE TYPE: DNA 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 1569 

TCACCAAGAC TTTGGCTGAA CGTGGTGGTG CTGATTTCTT GGACTACTCT 50 

TCCATTGACA AGGCTCCAGA AGAAAGAGCR AGAGGTATCA CTATTTCTAC 100 

TGCACATGTT GAATATGAAA CTGACAAGAG ACATTACTCT CACGTCGACT 150 

GTCCAGGTCA TGCTGATTAC ATCAAGT^TA TGATTACTGG TGCTGCCCAA 2 00 

ATGGATGGTG CTATCATTGT TGTTGCTGCT ACAGATGGTC AAATGCCTCA 2 50 

AACCAGAGAA CATTTGTTGT TGGCTAGACA AGTTGGTGTT CAACAYATCG 300 

TTGTTTTCGT TAACAAGGTT GACACTATCG ATGACCCTGA AATGTTAGAA 350 

TTGGTTGAAA TGGAAATGAG AGAATTATTG ACCCAATACG GTTTCGATGG 4 00 

TGACAACACT CCAGTCATCA TGGGTTCTGC TTTGTGTGCT TTAGAAGGTA 4 50 

AGCAACCAGA AATTGGTGAA CAAGCAATCA TGAAGTTATT GGACGCTGTT 500 

GACGAATACA TCCCAACTCC AGCTCGTGAT TT6GAAAAGC CTTTCTTGAT 550 

GCCTGTTGAA GATATCTTCT CCATCTCCGG TAGAGGTACC GTCGTAACTG 600 

GTAGAGTTGA ACGTGGTAAC TTGAAKAAGG GTRAAGAAAT CGAAATCGTT 650 

GGTCACAACA CCACTCCATT CAAGACCACT GTTACTGGTA TTGAAATGTT 700 

CAGAAAGGAA TTGGACCAAG CTATGGCTGG TGATAACGCT GGTGTCWTGT 750 

TGAGAGGTGT CAGAAGAGAC CAATTAAAGA GAGGTATGGT CTTGGCCAAG 800 

CCAGGTACTG TCAAGGCTCA CACCRAATTC TTGGCCTCTT TGTATATCTT 850 

GACCAAGGAA GAAGGTGGTA GACATTCCGG TTTCGGTGAA AATTACAGAC 900 

CTCAAATCTA CGTTAGAACT GCTGATGTCA CCGTTGTTTT GAAGTTCCCA 950 

GAAGCTGTTG AAGATCACTC TATGCAAGTC ATGCCAGGTG 990 
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2) INF0Rb4ATI0N FOR SEQ ID NO: 1570 



5 



(i) 



SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 1184 bases 

(B) TYPE: Nucleic acid 

(C) STRANDEDNESS: Double 

(D) TOPOLOGY: Linear 



(ii) 



MOLECULE TYPE: Genomic DNA 



10 



(vi) 



ORIGINAL SOURCE: 

(A) ORGANISM: Candida tropicalis 

(B) STRAIN: ATCC 7 50 



15 



(xi) 



SEQUENCE DESCRIPTION: SEQ ID NO: 1570 



GATCATGGTA AAACCACTTT GACTGCTGCC ATTACTAAAG TCTTGGCTGA 
TAAAGGTCAA GCTAACTTCT TAGATTACGG TTCTATTGAT AGAGCTCCAG 
AAGAAAGAGC CAGAGGTATC ACCATCTCTA CTGCCCACGT TGAATATGAA 

20 ACCGAAAAAA GACATTACGC CCATGTTGAG TATGTATACT TTTTTTTGTT 
GT6TAATTGT TTTAAAGATT TTCTTTAAAG CTGAAGAAGT CA7VATCAGTT 
CTTTGATAAC TTCTATTAAA AAAAGGGAAA AATTAACAAG ATATACTAAC 
ACTATAACAG TTGTCCTGGA CATCAAGATT ATATCAAGAA TATGATTACC 
GGTGCCGCTC AAATGGATGG TGCTATTATT GTTGTTGCTG CCACTGATGG 

25 TCAAATGCCA CAAACCAGAG AACATTTGTT GTTGGCTAGA CAAGTCGGTG 
TTCAAGATTT GGTTGTCTTT GTTAACAAAG TCGACACTAT TGATGACCCA 
GAAATGTTGG AATTGGTTGA AATGGAAATG AGAGAATTAT TGACTACTTA 
CGGTTTTGAT GGTGATAACA CTCCTGTTAT CATGGGTTCT GCTTTGATGG 
CCTTGCAAGG TAAACAACCA GAAATTGGTG AACAAGCTAT CATGAAATTG 

30 ATGGACGCTA TTGATGAACA CATTCCAACC CCAACCAGAG ACTTGGAACA 
ATCTTTCTTG ATGCCAGTTG AAGATGTTTT CTCCATTTCT GGTAGAGGTA 
CTGTTGTTAC TGGTAGAGTC GAAAGAGGTG TCTTAAAGAA GGGTGAAGAA 
ATTGAAATTG TTGGTGGTTT CGAAAAACCA TTCAAGACCA CTGTTACTGG 
TATTGAAATG TTCAAGAAAG AATTAGATGC TGCTATGGCT GGTGACAACT 

35 GTGGTGTCTT GTTGAGAGGT GTCAAGAGAG ACGAAATCAA GAGAGGTATG 
GTTTTGGCTA AACCAGGTAC TGCTACTTCC CACAAGAAAT TCTTGGCTTC 
CATGTATATC TTAACTGCTG AAGAAGGTGG TCGTTCCACT CCATTCGGTG 
AAGGTTACAA GCCACAATGT TTCTTCAGAA CTAACGATGT TACCACTTCC 
TTCTCTTTCC CAGAAGGTGA AGGTGTTGAC CACTCCCAAA TGGTTATGCC 

40 AGGTGACAAC ATTGAAATGG TCGGTGAATT GATT 



2) INFORMATION FOR SEQ ID NO: 1571 



50 
100 
150 
200 
250 
300 
350 
400 
450 
500 
550 
600 
650 
700 
750 
800 
850 
900 
950 
1000 
1050 
ilOO 
1150 
1184 



45 



50 



(i) 



SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 1071 bases 

(B) TYPE: Nucleic acid 

(C) STRANDEDNESS: Double 

(D) TOPOLOGY: Linear 



(ii) 



MOLECULE TYPE: Genomic DNA 
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(vi) ORIGINAL SOURCE: 

(A) ORGANISM: Candida viswanathii 

(B) STRAIN: ATCC 28269 

5 (xi) SEQUENCE DESCRIPTION: SEQ ID NO: 1571 

CGATCACGGT AAGACCACCT TGACCGCCGC CATCACCAAG GTCTTGGCCG 50 

ACAAGGGTCA GGCTAACTTC TTGGACTACG GATCCATTGA CAGAGCCCCT 100 

GAAGAAAGAG CAAGAGGTAT CACTATCTCC ACTGCCCACG TTGAATACGA 150 

10 GACTGATAAG AGACACTATG CCCACGTTGA TTGCCCGGGC CATCAAGATT 200 

ATATCAAGAA TATGATCACT GGTGCTGCCC AAATGGACGG TGCTATCATT 250 

GTTGTTGCTG CTACTGACGG TCAGATGCCA CAAACCAGAG AACACTTGTT 300 

GTTGGCTAGA CAAGTTGGTG TCCAAGACTT GGTTGTTTTC GTTAACAAGG 350 

TTGACACTAT CGATGACCCA GAAATGTTGG AATTGGTTGA AATGGAAATG 400 

15 AGAGAATTAT TATCTTCTTA YGGCTTTGAC GGTGACAACA CCCCAGTTGT 450 

CATGGGTTCC GCTTTGATGG CTTTGCAAGG TAAGCAACCA GAAATTGGTG 500 

AACAAGCTAT TATTAAGTTG ATGGACGCTA TTGATGAACA CATYCCAACC 550 

CCAACCAGAG ACTTGGAACA ACCATTCTTG TTGCCAGTTG AAGATGTCTT 600 

TTCTATTTCC GGTAGAGGTA CCGTCGTCAC TGGTAGAGTC GAAAGAGGTG 650 

20 TCTTGAAGAA GGGTGAAGAA ATTGAAATTG TCGGTAACTT TGAAAAGCCA 700 

TTCAAGACCA CCGTTACTGG TATTGAAATG TTCAAGAAGG AATTGGATGC 750 

TGCTATGGCT GGTGACAACT GTGGTGTCTT GTTGAGAGGT GTCAAGAGAG 800 

ACGAAATCAG CAGAGGTATG GTTTTGGCCA AGCCAGGTAC CGTCACTTCC 850 

CACAAGAAGT TCTTGGCCTC CATGTACATC TTGACTGGTG AAGAAGGTGG 900 

25 TCGTCGTACC CCATTCGGTG AAGGTTACAA GCCACAATGT TTCTTCAGAA 950 

CCAATGACAT CACCACCACT TTCACTTTCC CAGAAGGTGA AGGTGTCGAC 1000 

CACTCCCAAA TGGTTATGCC AGGTGACAAC ATCGAAATGG TTGGTGAATT 1050 

GTACAAGGCT TGTCCTTGGA A 1071 



30 



45 



2) INFORMATION FOR SEQ ID NO: 1572 



(i) SEQUENCE CHARACTERISTICS: 
35 (A) LENGTH: 817 bases 

(B) TYPE: Nucleic acid 

(C) STRANDEDNESS : Double 

(D) TOPOLOGY: Linear 

40 (ii) MOLECULE TYPE: Genomic DNA 

(vi) ORIGINAL SOURCE: 

(A) ORGANISM: Alcaligenes faecalis subsp. faecalis 

(B) STRAIN: ATCC 8750 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 1572 



TATCTTGGTT WGCTCGGCCG CTGACGGCCC AATGCCTCAG ACTCGCGAGC 50 

ACATCCTGCT GAGCCGTCAG GTTGGCGTTC CTTACATCAT CGTGTTCCTG 100 

50 AACAAGGCCG ACATGGTTGA TGACGAAGAG CTGATCGAAC TGGTTGAAAT 150 

GGAAGTTCGC GAGCTGTTGT CCAAGTACGA CTTCCCTGGC GACGACACCC 200 

CGATCATCAA GGGTTCGGCC AAACTGGCTC TGGAAGGCGA CGAAGGCCCA 250 

CTGGGCAGCC AAGCCGTTCT GGCTCTGGCC GAAGCGCTG6 ACAACTACAT 300 
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TCCTACGCCT GAGCGTGCCG TTGACGGTAC 
ACGTGTTCTC GATCTCCGGC CGTGGTACGG 
CGCGGCATCA TCAAGGTCGG CGAAGAAATC 
CACGGTCAAG ACCATTTGTA CCGGCGTTGA 
5 ACCAGGGCGA AGCTGGCGAT AACGTCGGTC 
CGTGAAGACG TGGAACGTGG TCAAGTTCTG 
GCCACACACT GACTTCGACG CCGAGGTGTA 
GTGGTCGTCA CACTCCTTTC TTCAAGGGCT 
CGTACAACTG ACGTGACCGG CACCATCGAG 
10 GGTTCTGCCA GGCGACAACA TTTCGATGAA 
TCGCCATGGA AGAAGGT 



PCT/CAOO/01150 

GTTCCTGATG CCTGTTGAAG 3 50 

TTGTGACCGG TCGTATTGAG 4 00 

GAAATCGTGG GTATCAAAGA 4 50 

AATGTTCCGC AAACTGCTGG 500 

TGCTGCTGCG TGGTACCAAG 550 

GCCAAGCCAG GCTCGATCAA 600 

CATTCTGTCC AAAGAAGAAG 650 

ACCGTCCTCA GTTCTACTTC 700 

CTGCCAGAAG ACAAGGAAAT 7 50 

AGTGTCCCTG ATCGCTCCTA 800 

817 



15 2) INFORMATION FOR SEQ ID NO: 1573 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 796 bases 

(B) TYPE: Nucleic acid 
20 (C) STRANDEDNESS : Double 

(D) TOPOLOGY: Linear 

(ii) MOLECULE TYPE: Genomic DNA 

25 (vi) ORIGINAL SOURCE: 

(A) ORGANISM: Prevoteiia buccalis 

(B) STRAIN: ATCC 35310 



30 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 157 3 



TATCCTTGTA GTTGCTGCTA CTGATGGTCC TATGCCACAG ACACGTGAGC 50 

ACGTGCTTTT GGCTCGTCAG GTAAACGTTC CTCGTTTGGT TGTGTTCATG 100 

AACAAGTGTG ACTTGGTAGA AGACGAAGAG ATGCTTGAAC TCGTTGAAAT 150 

GGAGTTGCGC GAACTTCTTG AGCAATACGA ATTCGAAGAG GATACTCCAA 2 00 

35 TCGTTCGTGG TTCTGCACTG GGTGCATTGA ATGGTGTTGA CAAGTGGGTT 250 

GACAGCGTGA TGACGTTGAT GGACACTGTT GACGAGTGGA TTCAAGAGCC 300 

AGAGCGTGAC CTTGACAAAC CTTTCTTGAT GCCAGTAGAG GACGTGTTCT 350 

CTATCACAGG TCGTGGTACC GTTGTAACAG GACGTATTGA GACTGGTAAG 4 00 

GTAAAGGTTG GCGACGAGAT TCAGTTGCTC GGTCTTGGTG AGGACAAGAA 450 

40 GTCTGTTGTA ACAGGCGTTG AAATGTTCCG TAAGATTCTT TCTGAAGGTG 500 

AAGCAGGTGA TAACGTAGGA CTTTTGCTCC GCGGTATCGA TAAGGATGAA 550 

GTAAAGCGTG GTATGGTTGT TGTACACCCA GGTGCCATCA CTCCTCACGA 600 

TCACTTCAAG GCTTCCATCT ATGTATTGAA GAAGGAAGAG GGTGGACGTC 650 

ATACTCCATT CGGAAACAAG TATCGTCCTC AGTTCTATCT CCGTACAATG 700 

45 GACTGTACAG GTGAGATCAC TTTGCCAGAA GGCGTAGAGA TGGTGATGCC 750 

TGGTGACAAC GTAGAGATTG AGGTTACCTT GATTTACAAG GTTGCC 796 



50 2) INFORMATION FOR SEQ ID NO: 1574 

(i) SEQUENCE CHARACTERISTICS: 
(A) LENGTH: 820 bases 
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10 



(B) TYPE: Nucleic acid 

(C) STRANDEDNESS: Double 

(D) TOPOLOGY: Linear 

(ii) MOLECULE TYPE: Genomic DNA 

(vi) ORIGINAL SOURCE: 

(A) ORGANISM: Succini\ribrio dextrinosolvens 

(B) STRAIN: ATCC 19716 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 1574 



GCTATTCTAG TAGTAGCAGC AACTGATGGT CCTATGCCAC AGACCCGTGA 50 

GCACATCCTA TTAGCACGTC AGGTAGGCGT ACCATACATC ATCGTATTCC 100 

15 TAAACAAGTG CGATATGGTT GACGACGAGG AATTATTAGA GTTAGTTGAG 150 

ATGGACGTAC GTGATCTATT AAATCAGTAC CAGTTCCCAG GCGACGACAC 200 

TCCAATCATC CGTGGTTCAG CACTAGGTGC ATTAAACGGC GAAGAGAAGT 250 

GGAAAGAGGC AATCTATCAG TTAGCAGACA CTCTAGATTC ATACATTCCA 300 

GAGCCAAAGC GTGATATCGA TGATCCATTC CTATTACCAA TCGJ\AGATAT 350 

20 CTTCTCAATC TCAGGTCGTG GTACTGTAGT AACCGGCCGT GTAGAGCGTG 400 

GTATTGTACA CGTAGGTGAC GAAGTTGAAA TCGTTGGTAT TCGTCCAACC 450 

ACCAAGACCA CTGTAACTGG CGTTGAAATG TTCCGTAAGT TACTAGACGA 500 

AGGTCGTGCA GGTGATAACG TTGGTGTTCT ACTACGTGGT ACCAAGCGTG 550 

ATGAGGTTGA GCGTGGTCAG GTTCTAGCTG CTCCAGGCAC AATCACTCCA 600 

25 CACACCAAGT TCACTGGTCA GGTTTACGTA CTAAGCAAGG ATGAAGGTGG 650 

TCGTCACACT CCATTCTTCA AGGGCTACCG TCCACAGTTC TTCTTCCGTA 700 

CAACCGATAT TACCGGTTCT ATCGATCTGA AAGAGGGCGT AGAGATGGTA 750 
ATGCCAGGTG ATAACACCGA CATGACCGTA ACCCTAATCC ACCCAGTAGC * 800 

TATGGCTGAA GGCGAGAGAT 820 

30 



2) INFORMATION FOR SEQ ID NO: 1575 

35 (i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 803 bases 

(B) TYPE: Nucleic acid 

(C) STRANDEDNESS: Double 

(D) TOPOLOGY: Linear 



40 



(ii) MOLECULE TYPE: Genomic DNA 



(vi) ORIGINAL SOURCE: 

(A) ORGANISM: TetragGnococcus halophilus 
45 (B) STRAIN: ATCC 33315 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 1575 

GGTAGTGTCT GCAGCTGACG GCCCAATGCC ACAAACTCGT GAACATATCC 50 

50 TATTGTCACG TAACGTTGGT GTACCATACA TCGTTGTATT CTTAAACAAA 100 

ATGGATATGG TTGACGATGA AGAATTACTT GAATTAGTTG AAATGGAAGT 150 

ACGTGATCTA TTAACTGAAT ACGACTTCCC AGGTGATGAT ACTCCTGTTA 2 00 

TTTCTGGTTC AGCTTTGAAA GCTTTAGAAG GCGACGAAGA ATATGAACAA 250 
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AAAGTCTTAG 
ACGTGACCAT 
TCACTGGTCG 
AAAGTCGGTG 
GACAACTGTT 
AAGCTGGCGA 
ATCGAACGTG 
AAAATTCTCA 
ATACTCCATT 
GACATCACTG 
AGGTGATAAC 
AAA 



ACTTAATGGC 
GACAAACCGT 
TGGAACTGTT 
ATGAAGTTGA 
ACGGGTGTAG 
TAACATCGGT 
GTCAAGTGTT 
GCTGAAGTTT 
CTTCTCAAAC 
GTGTCATTGA 
GTAGCAATGG 



AGCTGTAGAT 
TCATGATGCC 
GCTACAGGTC 
TATCATCGGT 
AAATGTTCCG 
ACATTATTGC 
GGCTAAACCA 
ATGTATTAAC 
TATCGCCCAC 
ATTGCCAGAA 
AAGTTGAATT 



GATTTCATCC CAACTCCTGA 
GATTGAAGAT GTTTTCTCAA 
GTGTTGAACG TGGAACTATT 
ATTCATGAAA ATGTTAAAAA 
TAAATTGTTG GATTACGCTG 
GTGGTGTTTC TCGTGATGAT 
GGCACAATCA CACCACATAC 
AAAAGAAGAA GGCGGACGTC 
AATTTTACTT CCGTACAACT 
GGTACTGAAA TGGTTATGCC 
AATTCACCCT GTTGCTATTG 



300 
350 
400 
450 
500 
550 
600 
650 
700 
750 
800 
803 



2) INFORMATION FOR SEQ ID NO: 1576 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 805 bases 

(B) TYPE: Nucleic acid 

(C) STRANDEDNESS : Double 

(D) TOPOLOGY: Linear 

(ii) MOLECULE TYPE: Genomic DNA 

(vi) ORIGINAL SOURCE: 

(A) ORGANISM: Campylobacter jejuni subp. jejuni 

(B) STRAIN: ATCC 33292 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 1576 

TTAATGAAGC AATTGTTGTA AATTTTGAAA GCGAAGGAAA AAAACATAAA 50 

CTTGTTTTAG AAGTAGCAGC TCATTTAGGA GATAATAGAG TTAGAACTAT 100 

TGCTATGGAT ATGACAGATG GTTTGGTAAG GGGCTTAAAA GCTGAGGCTT 150 

TAGGTGCTCC TATTAGTGTT CCTGTTGGTG AGAAAGTTTT AGGAAGAATT 200 

TTCAATGTTA CTGGAGATTT GATCGATGAA GGTGAAGAAA TTTCTTTTGA 250 

TAAAAAATGG GCAATTCATA GAGATCCGCC AGCTTTTGAA GATCAAAGCA 300 

CAAAAAGTGA GATTTTTGAA ACAGGGATTA AAGTTGTAGA TTTGCTTGCT 350 

CCTTATGCAA AAGGTGGTAA AGTAGGTCTT TTTGGTGGTG CAGGTGTTGG 400 

TAAAACTGTT ATTATTATGG AGCTTATTCA CAATGTTGCA TTTAAGCATA 4 50 

GCGGCTATTC TGTATTTGCA GGTGTGGGTG AGAGAACTCG TGAAGGAAAT 500 

GACCTTTATA ATGAAATGAA AGAAAGTAAT GTTTTAGACA AAGTTGCTCT 550 

ATGTTATGGA CAAATGAATG AACCACCAGG AGCAAGAAAT CGTATTGCTT 600 

TAACAGGTTT AACAATGGCT GAGTATTTTA GAGATGAAAT GGGTCTTGAT 650 

GTGCTTATGT TTATTGATAA TATCTTTAGA TTTTCACAAT CAGGTTCTGA 700 

AATGTCAGCA CTTTTAGGAA GAATTCCATC AGCTGTGGGT TATCAACCAA 750 

CCCTAGCAAG TGT^AATGGGT AAATTCCAAG AAAGAATTAC TTCAACTAAA 800 
AAAGG 



2) INFOE^^TION FOR SEQ ID NO: 1577 
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(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 809 bases 

(B) TYPE: Nucleic acid 

(C) STRANDEDNESS: Double 
5 (D) TOPOLOGY: Linear 

(ii) MOLECULE TYPE: Genomic DNA 

(vi) ORIGINAL SOURCE: 
10 (A) ORGANISM: Campylobacter rectus 

(B) STRAIN: ATCC 33238 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 1577 

15 ATTTACCCAA GATCAACGAA GCCGTCGAAG TAAATTTCGA GGTTGAGGGC 50 

AAGCAAAACA GACTTGTGTT AGAGGTTGCC GCACACCTTG GCGATAACCG 100 

CGTAAGAACG ATCGCTATGG ATATGAGCGA AGGCTTAACC AGAGGCCTTG 150 

AGGCTACGGC TCTTGGCGCG CCTATTAGCG TTCCGGTCGG CGAAAAAGTT 200 

TTGGGAAGGA TTTTTAACGT CGTCGGCGAT CTGATCGACG AGGGTGAAGG 250 

2 0 CATAGAATTT GATAAAAAAT GGTCTATCCA CCGCGATCCT CCGCCGTTTG 300 

AAGAGCAAAG CACGAAGAGT GAAATTTTTG AAACGGGTAT AAAAGTGGTC 350 

GATCTTCTAG CCCCTTATGC AAAAGGCGGT AAGGTCGGAT TATTCGGCGG 400 

TGCCGGCGTC GGTAAGACGG TTATTATCAT GGAGCTTATC CACAACGTTG 450 

CATTTAAGCA TAGCGGTTAT TCCGTGTTTG CCGGCGTGGG CGAGCGAACC 500 

25 CGCGAAGGAA ACGACCTTTA TCACGAGATG AAAGAGAGTA ACGTTTTGGA 550 

CAAAGTCGCC TTGTGCTACG GCCAGATGAA CGAGCCGCCG GGAGCAAGAA 600 

ACCGCATCGC TCTAACAGGC CTAACGATGG CTGAATACTT CCGCGACGAG 650 

ATGGGACTTG ACGTTTTGAT GTTTATAGAC AACATCTTCC GTTTCTCTCA 700 

GTCTGGCGCT GAGATGTCGG CGCTTCTTGG ACGTATCCCG TCAGCCGTTG 7 50 

30 GTTATCAGCC GACTTTGGCG AGCGAAATGG GCAAATTCCA AGAGAGAATT 800 

ACATCAACC 809 



35 2) INFORMATION FOR SEQ ID NO: 1578 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 1671 bases 

(B) TYPE: Nucleic acid 
40 (C) STRANDEDNESS: Double 

(D) TOPOLOGY: Linear 

(ii) MOLECULE TYPE: Genomic DNA 

45 (vi) ORIGINAL SOURCE: 

(A) ORGANISM: Enterococcus casseliflavus 

(B) STRAIN: ATCC 25788 



50 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 157 8 

AATCGAAGTG CAACGTTCAT TGCGTGTACT TGATGGTGCT GTAACTGTGT 50 
TGGACTCACA ATCAGGTGTA GAACCTCAAA CAGAAACAGT TTGGCGTCAA 100 
GCGACAGATT ACCGCGTACC GCGTGTTGTA TTCTGTAACA AAATGGACAA 150 
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AATTGGTGCA GACTTCTTAT ACTCTGTATC AACTTTACAT GATCGTTTAC 200 

AAGCAAATGC TCACCCAATC CAATTACCAA TTGGTGCGGA AGATGACTTT 250 

ACTGGTATTA TCGACTTAGT AAAAATGAAA GCTGAAATCT ACACAAATGA 300 

CTTAGGAACT GAAATCC/^G AGACTGAAAT TCCTGAAGAA TACGTAG7\AT 350 

5 TAGCTGAAGA ATGGCGCGAA AAATTAATTG AAGCTGTTGC TGATACTGAT 4 00 

GAAGAACTAA TGATGAAATT CTTGGAAGGT GAAGAAATCA CTGAAGAAGA 4 50 

ATTGAAAGCT GGTATTCGTC AAGCAACATT GACTGTTGAC TTTTTCCCTG 500 

TTCTTTGCGG ATCTGCCTTT AAAAACAAAG GGGTTCAATT GATGTTGGAT 550 

GCAGTCATCG ACTACTTGCC TTCACCACTT GATGTTCCTG CGATTAAAGG 600 

10 GATCAATCCT AAAACAGACG AAGAAACTGA TCGTCCGGCT GACGATGAAG 650 

CACCATTTGC TTCATTAGCA TTTAAAGTAA TGACTGACCC ATTCGTARGT 7 00 

CGTTTGACAT TCTTYCGTGT GTATTCARGT ATCTTGAACT CTGGATCATA 750 

CGTATTGAAT GCTTCAAAAG GCAAACGCGA ACGTATCGGT CGGATCCTAC 800 

AAATGCACGC CAACACTCGT GCTGAAATCC AAACAGTATA CTCAGGCGAT 850 

15 ATCGCCGCTG CTGTTGGTTT GAAAGACACA ACAACTGGGG ATCCACTATG 900 

TGATGAAAAA TCCCCAGTAA TCCTTGAATC AATCGAATTC CCAGAACCAG 950 

TTATCGAAGT CGCTGTTGAG CTTAAATCAA AAGCTGACCA AGATAAAATG 1000 

GGGGTTGCTT TACAAAAACT TGCTGAAGT^ GATCCATCAT TCCGTGTGGA 1050 

AMCAAACGCT GAAACAGGCG AAACTGTTAT CGCTGGTATG GGAGAACTTC 1100 

20 ACTTGGACGT CTTAGTTGAC CGTATGCGTC GCGAATTTAA AGTTGAAGCA 1150 

AACGTAGGTG CGCCTCAAGT TTCTTATCGT GAAACATTCC GTGCAGCAAC 1200 

ACAAGCGGAA GGTAAATTTG TACGTCAGTC TGGTGGTAAA GGTCAATACG 1250 

GTCACGTATG GGTCGAATTT ACACCAAACG AAGAAGGTAA AGGCTTCGAA 1300 

TTCGAAAACG CGATTGTCGG TGGTGTGGTT CCTCGTGAAT ACATCCCAGC 1350 

25 AGTTGAAAAA GGACTTGAAG AATCAATGGC GAACGGTGTC TTAGCCGGTT 14 00 

ACCCATTAGT AGACATCAAA GCAAAACTTT ATGATGGTTC ATACCATGAT 1450 

GTCGATTCAA GTGAAACTGC CTTCCGTGTT GCAGCTTCTA TGGCTTTACG 1500 

TGCTGCAGCG AAGAAAGCAA ACCCAGTAAT TCTTGAACCA ATGATGAAAG 1550 

TAGTTATCAC TGTACCAGAA GATTACTTAG GTGATGTTAT GGGTCACGTA 1600 

3 0 ACTGCTCGTC GTGGACGCGT AGAAGGAATG GAAGCACWCG GTAACTCACA 1650 

AATCGTGAAC GCAATCGTGC C 1671 



35 2) INFORMATION FOR SEQ ID NO: 1579 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 1662 bases 

(B) TYPE: Nucleic acid 
40 (C) STRANDEDNESS : Double 

(D) TOPOLOGY: Linear 

(ii) MOLECULE TYPE: Genomic DNA 

45 (vi) ORIGINAL SOURCE: 

(A) ORGANISM: Enterococcus gallinarum 

(B) STRAIN: ATCC 49573 



50 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 1579 

GAAGTACAAC GTTCACTACG GGTTCTTGAC GGWGCTGTAA CAGTATTGGA 50 
CTCACAATCT GGTGTAGAAC CACAAACTGA AACAGTTTGG CGTCAAGCGA 100 
CAGATTACCG CGTACCACGT ATCGTATTCT GTAACAAAAT GGATAAAATC 150 
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GGTGCAGACT TCTTATACTC TGTATCTACT 
AAATGCTCAT CCAATCCAAT TACCAATTGG 
GTATCATCGA TCTAGTAAAA ATGAAAGCTG 
GGAACAGAGA TTCAAGAAAC TGAAATTCCT 
TGAAGAATGG CGCGAAAAAT TAGTTGAAGC 
AGCTAATGAT GAAATTCTTG GAAGGTGAAG 
AAAGCTGGTA TCCGTCAAGC GACATTGACT 
TTGTGGTTCA GCCTTCAAAA ACAAAGGGGT 
TCATCGACTA CCTTCCTTCA CCACTTGATG 
AATCCTAAAA CTGACGAAGA AACTGATCGT 
TTTTGCTTCA CTAGCATTTA AAGTAATGAC 
TGACATTCTT CCGTGTGTAT TCAGGTGTCT 
TTGAATGCTT CAAAAGACAA ACGCGAACGT 
GCACGCGAAC ACTTGTGCAG AAATCCAAAC 
CTGCAGCTGT TGGTTTGAAA GATTCCACAA 
AAAAATCACC CAGTAATCCT TGAATCAATC 
CGAAGTAGCT GTTGAACYTA AATCAAAAGC 
TGGCTTTACA AAAACTTGCT GAAGAAGATC 
AACGCTGAAA CTGGCGAAAC TGTTATCGCA 
GGACGTCCTT GTTGACCGTA TGCGTCGCGA 
TTGGTGCGCC TCAAGTTTCT TACCGCGAAA 
GCCGAAGGTA AATTTGTACG TCAGTCTGGT 
CGTATGGATC GAATTTACAC CAAACGAAGA 
AAAACGCAAT TGTCGGTGGT GTGGTTCCAC 
GAAAAAGGAC TTGAAGACTC 7UVTGGCTAAC 
ATTGGTTGAC ATCAAAGCCA AGCTTTACGA 
ATTCAAGTGA AACAGCCTTC CGTGTGGCAG 
GCAGCGAAGA AAGCTAATCC AGTGATTCTT 
TATCACTGTT CCTGAAGATT ACTTAGGTGA 
CTCGTCGTGG ACGTGTTGAA GGTATGGAAG 
GTTAACGCGA TT 



PCT/CAOO/01150 

TTACATGATC GCTTGCAAGC 200 

TGCGGAAGAT GACTTTACTG 250 

AGATCTATAC AAACGATTTA 300 

GAAGAGTACA AAGAATTAGC 350 

TGTTGCAGAT ACTGACGAAG 400 

AAATCACTGA AGAAGAATTG 450 

GTTGAATTTT TCCCAGTTCT 500 

TCAATTGATG TTGGATGCAG 550 

TTCCTGCAAT CA7\AGGGATC 600 

CCTGCTGACG ATGAAGCGCC 650 

TGACCCATTC GTAGGTCGTT 700 

TGAACTCTGG ATCATATGTC 750 

ATCGGTCGTA TTCTGCAAAT 800 

AGTTTATTCA GGCGATATCG 850 

CAGGGGATAC ATTGTGTGCG 900 

GAATTCCCAG AMCCAGTTAT 950 

TGACCAAGAT AAAATGGGTG 1000 

CTTCATTCCG TGTAGAAMCA 1050 

GGGATGGGTG AACTTCACTT 1100 

ATTTAAAGTT GAAGCAAACG 1150 

CATTCCGTGC TTCTACGCAA 1200 

GGTAAAGGTC AATACGGTCA 1250 

AGGTAAAGGC TTCGAATTCG 1300 
GTGAATACAT CCCAGCAGTT " 1350 

GGTGTTCTAG CTGGTTATCC 1400 

TGGTTCATAC CATGATGTCG 1450 

CTTCAATGGC TTTACGTGCT 1500 

GAACCAATGA TGAAAGTTGT 1550 

TATTATGGGA CACGTAACTG 1600 

CGCATGGTAA CTCACAAATC 1650 

1662 



2) INFORMATION FOR SEQ ID NO: 1580 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 1669 bases 

(B) TYPE: Nucleic acid 

(C) STRANDEDNESS: Double 

(D) TOPOLOGY: Linear 

(ii) MOLECULE TYPE: Genomic DNA 

(vi) ORIGINAL SOURCE: 

(A) ORGANISM: Streptococcus mitis 

(B) STRAIN: ATCC 4 9456 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 1580 

CAATCGAAGT ACAACGTTCT CTTCGTGTAT TGGATGGTGC TGTTACCGTT 50 
CTTGACTCAC AATCAGGTGT TGAGCCTCAA ACTGAAACAG TTTGGCGTCA 100 
AGCAACTGAG TATGGAGTTC CACGTATCGT ATTTGCCAAC T^AAATGGACA 150 

819 



SUBSTITUTE SHEET (RULE 26) 



wo 01/23604 



PCT/CAOO/01150 



AAATCGGTGC TGACTTCCTT TACTCTGTAA GCACACTTCA CGATCGTCTT 
CAAGCAAATG CACACCCAAT CCAATTGCCA ATCGGTTCTG AAGATGACTT 
CCGTGGTATC ATCGACTTGA TCAAGATGAA AGCTGAAATC TATACTAACG 
ACCTTGGTAC AGATATCCTT GAAGAAGACA TCCCAGCTGA ATACCTTGAC 
5 CAAGCTCAAG AATACCGTGA AAAATTGATC GAAGCAGTTG CTGAAACTGA 
CGAAGAATTG ATGATGAAAT ACCTCGAAGG TGAAGAAATC ACTAACGAAG 
AATTGAAAGC TGGTATCCGT AAAGCGACTA TCAACGTTGA ATTCTTCCCA 
GTATTGTGTG GCTCTGCCTT CAAGAACAAA GGTGTTCAAT TGATGCTTGA 
TGCGGTTATT GACTACCTTC CAAGCCCACT TGACATCCCA GCGATCAAAG 

10 GTATCAACCC AGATWCAGAT GAAGAAGAAA CTCGTCCAGC ATCTGATGAA 
GAGCCATTTG CAGCTCTTGC CTTCAAGATC ATGACAGACC CATTCGTAAG 
TCGTTTGACA TTCTTCCGTG TTTACTCARG TGKTCTTCAA TCAGGTTCAT 
ACGTATTGAA CACTTCTAAA GGKAT^CGTG AGCGTATCGG ACGTATCCTT 
CAAATGCACG CCAACAGCCG TCAAGAAATT GACACTGGTT ACTCAAGAGA 

15 TATCGCTGCT GCCGTTGGTT TGAAAGATAC TCCAACTGGT GACTCATTGC 
CAGATGAAAA AGCTAAAATC ATTCTTGAGT CAATCAACGT TCCAGAMCCA 
GTTATCCAAT TGATGGTTGA GCCAAAATCT AAAGCTGACC AAGCCAAGAT 
GGGTATCGCC CTTCAAAAAT TGGCTGAAGA AGATCCAACA TTCCGCGTTG 
AAMCAAACGT TGAAACTGGT GAAMCAGTTA TCTCTGGTAT GGGTGAGCTT 

20 CACCTTGACG TCCTTGTTGA CCGTATGCGT CGTGAGTTCA AAGTTGAAGC 
AAACGTAGGT GCTCCTCAAG TATCTTACCG TGAAACATTC CGCGCTTCTA 
CTCAAGCACG TGGATTCTTC AAACGTCAGT CTGGTGGTAA AGGTCAATTC 
GGTGATGTAT GGATTGAATT TACTCCAAAC GAAGAAGGTA AAGGATTCGA 
ATTCGAAAAC GCAATCGTCG GTGGTGTGGT TCCTCGTGAA TTTATCCCAG 

25 CGGTTGAAAA AGGTTTGGTA GAATCTATGG CTAACGGTGT ACTTGCAGGT 
TACCCAATGG TTGACGTTAA AGCTAAGCTT TACGATGGTT CATACCACGA 
TGTCGACTCA TCTGAAACTG CCTTCAAGAT TGCGGCTTCA CTTGCTCTTA 
AAGAAGCTGC TAAATCAGCA CAACCAGCTA TCCTTGAGCC AATGATGCTT 
GTAACAATCA CTGTTCCAGA AGAAAACCTT GGTGATGTTA TGGGTCACGT 

30 AACTGCTCGT CGTGGACGTG TAGATGGTAT GGAAGCACWC GGTAACAGCC 
AAATCGTTCG TGCTTACGT 



35 2) INFORMATION FOR SEQ ID NO: 1581 



200 
250 
300 
350 
400 
450 
500 
550 
600 
650 
700 
750 
800 
850 
900 
950 
1000 
1050 
1100 
1150 
1200 
1250 
1300 
1350 
1400 
1450 
1500 
1550 
1600 
1650 
1669 



40 



(i) 



SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 26 bases 

(B) TYPE: Nucleic acid 

(C) STEIANDEDNESS : Single 

(D) TOPOLOGY: Linear 



(ii) 



MOLECULE TYPE: DNA 



45 



(xi) 



SEQUENCE DESCRIPTION: SEQ ID NO: 1581 



AATTGGGGAC TACACCTATT ATGATG 



50 



2) INFORMATION FOR SEQ ID NO: 1582 



(i) SEQUENCE CHARACTERISTICS: 
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(A) LENGTH: 23 bases 

(B) TYPE: Nucleic acid 

(C) STRANDEDNESS: Single 

(D) TOPOLOGY: Linear 

5 

(ii) MOLECULE TYPE: DNA 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 1582 
10 GGCAAATCAG TCAGTTCAGG AGT 23 



2) INFORMATION FOR SEQ ID NO: 1583 



15 



(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 23 bases 

(B) TYPE: Nucleic acid 

(C) STRANDEDNESS: Single 
20 (D) TOPOLOGY: Linear 

(ii) MOLECULE TYPE: DNA 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 1583 

25 

CGATTGGCAA CAATACACTC CTG 23 



30 2) INFORMATION FOR SEQ ID NO: 1584 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 26 bases 

(B) TYPE: Nucleic acid 
35 (C) STRANDEDNESS: Single 

(D) TOPOLOGY: Linear 

(ii) MOLECULE TYPE: DNA 
40 (xi) SEQUENCE DESCRIPTION: SEQ ID NO: 1584 

TCACCTATTT TTACGCCTGG TAGGAC 26 



45 



2) INFORMATION FOR SEQ ID NO: 1585 



(i) SEQUENCE CHARACTERISTICS: 
(A) LENGTH: 645 bases 
50 (B) TYPE: Nucleic acid 

(C) STRANDEDNESS: Double 

(D) TOPOLOGY: Linear 
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(ii) MOLECULE TYPE: Genomic DNA 

(vi) ORIGINAL SOURCE: 

(A) ORGANISM: Enterococcus faecium 
5 (C) ACCESSION NUMBER: AF139725 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 1585 

ATGACTATAC CTGACGCAAA TGCAATCTAT CCTAACTCAG CCATCAAAGA 50 

10 GGTTGTCTTT ATCAAGAACG TGATCAAAAG TCCCAATATT GAAATTGGGG 100 

ACTACACCTA TTATGATGAC CCAGTAAATC CCACCGATTT TGAGAAACAC 150 

GTTACCCATC ACTATGAATT TCTAGGCGAC AAATTAATCA TCGGTAAATT 200 

TTGTTCTATC GCCAGTGGCA TTGAATTTAT CATGAACGGT GCCAACCACG 250 

TAATGAAAGG TATTTCGACT TATCCATTTA ATATTTTAGG TGGCGATTGG 300 

15 CAACAATACA CTCCTGAACT GACTGATTTG CCGTTGAAAG GTGATACTGT 350 

AGTCGGAAAT GACGTGTGGT TTGGGCAAAA TGTGACCGTC CTACCAGGCG 4 00 

TAAAAATAGG TGACGGTGCC ATTATCGGAG CAAATAGTGT TGTAACAAAA 450 

GACGTCGCTC CATATACAAT TGTCGGTGGC AATCCAATTC AACTCATCGG 500 

ACC7y\GATTT GAACCGGAAG TTATTCAAGC ATTAGAAAAT CTGGCATGGT 550 

20 GGAATAAAGA TATTGAATGG ATAACTGCTA ATGTTCCTAA ACTAATGCAA 600 

ACAACACCCA CACTTGAATT GATAAACAGT TTAATGGAAA AATAA 645 



25 2) INFORMATION FOR SEQ ID NO: 1586 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 26 bases 

(B) TYPE: Nucleic acid 
30 (C) STRANDEDNESS : Single 

(D) TOPOLOGY: Linear 

(ii) MOLECULE TYPE: DNA 
35 (xi) SEQUENCE DESCRIPTION: SEQ ID NO: 1586 

ATTCCCACAA TCTTTTTTAT CAATAA 26 



40 



50 



2) INFORMATION FOR SEQ ID NO: 1587 



(i) SEQUENCE CHARACTERISTICS: 
(A) LENGTH: 24 bases 
45 (B) TYPE: Nucleic acid 

(C) STRANDEDNESS: Single 

(D) TOPOLOGY: Linear 



(ii) MOLECULE TYPE: DNA 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 1587 

CATTGTTCAG ATTCGGTAAA GTTC 24 
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2) INFORMATION FOR SEQ ID NO: 1588 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 25 bases 

(B) TYPE: Nucleic acid 

(C) STRANDEDNESS: Single 

(D) TOPOLOGY: Linear 

(ii) MOLECULE TYPE: DNA 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 1588 
GTTTTTGAAG TTAAATAGTG TTCTT 



2) INFORMATION FOR SEQ ID NO: 1589 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 21 bases 

(B) TYPE: Nucleic acid 

(C) STRANDEDNESS: Single 

(D) TOPOLOGY: Linear 

(ii) MOLECULE TYPE: DNA 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 1589 
CTTCCATTTG TACTTTCCCT A 



2) INFORMATION FOR SEQ ID NO: 1590 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 1920 bases 

(B) TYPE: Nucleic acid 

(C) STRANDEDNESS: Double 

(D) TOPOLOGY: Linear 

(ii) MOLECULE TYPE: Genomic DNA 

(vi) ORIGINAL SOURCE: 

(A) ORGANISM: Cloning vector pFW16 
(C) ACCESSION NUMBER: U50983 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 1590 

ATGAAAATTA TTAATATTGG AGTTTTAGCT CATGTTGATG CAGGAAAAAC 50 

TACCTTAACA GAAAGCTTAT TATATAACAG TGGAGCGATT ACAGAATTAG 100 

GAAGCGTGGA CAAAGGTACA ACGAGGACGG ATAATACGCT TTTAGAACGT 150 

CAGAGAGGAA TTACAATTCA GACAGGAATA ACCTCTTTTC AGTGGGAAAA 200 

TACGAAGGTG AACATCATAG ACACGCCAGG ACATATGGAT TTCTTAGCAG 250 
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AAGTATATCG TTCATTATCA GTTTTAGATG GGGCAATTCT ACTGATTTCT 300 

GCAAAAGATG GCGTACAAGC ACAAACTCGT ATATTATTTC ATGCACTTAG 350 

GAAAATGGGG ATTCCCACAA TCTTTTTTAT CAATAAGATT GACCAAAATG 4 00 

GAATTGATTT ATCAACGGTT TATCAGGATA TTAAAGAGAA ACTTTCTGCC 4 50 

5 GAAATTGTAA TCAAACAGAA GGTAGAACTG TATCCTAATG TGTGTGTGAC 500 

GAACTTTACC GAATCTGAAC AATGGGATAC GGTAATAGAG GGAAACGATG 550 

ACCTTTTAGA GAAATATATG TCCGGTAAAT CATTAGAAGC ATTGGAACTC 600 

GT^CAAGAGG AAAGCATAAG ATTTCAGAAT TGTTCTCTGT TCCCTCTTTA 650 

TCATGGAAGT GCAAAAAGTA ATATAGGGAT TGATAACCTT ATAGAAGTTA 700 

10 TTACTAATAA ATTTTATTCA TCAACACATC GAGGTCCGTC TGAACTTTGC 7 50 

GGAAATGTTT TCAAAATTGA ATATACAAAA AAAAGACAAC GTCTTGCATA 800 

TATACGCCTT TATAGTGGAG TACTACATTT ACGAGATTCG GTTAGAGTAT 850 

CAGAAAAGGA AAAAATAAAA GTTACAGAAA TGTATACTTC AATAAATGGT 900 

GAATTATGTA AGATTGATAG AGCTTATTCT GGAGAAATTG TTATTTTGCA 950 

15 AAATGAGTTT TTGAAGTTAA ATAGTGTTCT TGGAGATACA AAACTATTGC 1000 

CACAGAGAAA AAAGATTGAA AATCCGCACC CTCTACTACA AACAACTGTT 1050 

GAACCGAGTA AACCTGAACA GAGAGAAATG TTGCTTGATG CCCTTTTGGA 1100 

AATCTCAGAT AGTGATCCGC TTCTACGATA TTACGTGGAT TCTACGACAC 1150 

ATGAAATTAT ACTTTCTTTC TTAGGGAAAG TACAAATGGA AGTGATTAGT 1200 

20 GCACTGTTGC AAGAAAAGTA TCATGTGGAG ATAGAAATAA CAGAGCCTAC 1250 

AGTCATTTAT ATGGAGAGAC CGTTAAAAAA TGCAGAATAT ACCATTCACA 1300 

TCGAAGTGCC GCCAAATCCT TTCTGGGCTT CCATTGGTCT ATCTGTATCA 1350 

CCGCTTCCGT TGGGAAGTGG AATGCAGTAT GAGAGCTCGG TTTCTCTTGG 1400 

ATACTTAAAT CAATCATTTC AAAATGCAGT TATGGAAGGG ATACGCTATG 14 50 

25- GTTGTGAACA AGGATTGTAT GGTTGGAATG TGACGGACTG TAAAATCTGT 1500 

TTTAAGTATG GCTTATACTA TAGCCCTGTT AGTACCCCAG CAGATTTTCG 1550 

GATGCTTGCT CCTATTGTAT TGGAACAAGT CTTAAAAAAA GCTGGAACAG 1600 

AATTGTTAGA GCCATATCTT AGTTTTAAAA TTTATGCGCC ACAGGAATAT 1650 

CTTTCACGAG CATACAACGA TGCTCCTAAA TATTGTGCGA ACATCGTAGA 17 00 

30 CACTCAATTG AAAAATAATG AGGTCATTCT TAGTGGAGAA ATCCCTGCTC 17 50 

GGTGTATTCA AGAATATCGT AGTGATTTAA CTTTCTTTAC AAATGGACGT 1800 

AGTGTTTGTT TAACAGAGTT AAAAGGGTAC CATGTTACTA CCGGTGAACC 1850 

TGTTTGCCAG CCCCGTCGTC CAAATAGTCG GATAGATAAA GTACGATATA 1900 

TGTTCAATAA AATAACTTAG 1920 

35 



2) INFORMATION FOR SEQ ID NO: 1591 

40 (i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 2 0 bases 

(B) TYPE: Nucleic acid 

(C) STRANDEDNESS: Single 

(D) TOPOLOGY: Linear 

45 

(ii) MOLECULE TYPE: DNA 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 1591 
50 ATGAGGTAAT AGAACGGATT 20 

2) INFORMATION FOR SEQ ID NO: 1592 
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(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 21 bases 

(B) TYPE: Nucleic acid 

(C) STRANDEDNESS : Single 

(D) TOPOLOGY: Linear 

(ii) MOLECULE TYPE: DNA 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 1592 
CAGTATTTCA GTAAGCGTAA A 



2) INFORMATION FOR SEQ ID NO: 1593 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 36 bases 

(B) TYPE: Nucleic acid 

(C) STRANDEDNESS: Single 

(D) TOPOLOGY: Linear 

(ii) MOLECULE TYPE: DNA 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 1593 
CCGAGCGATT TACCGGATAC TTGGCTGCIC GCTCGG 



2) INFORMATION FOR SEQ ID NO: 1594 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 1032 bases 

(B) TYPE: Nucleic acid 

(C) STRANDEDNESS: Double 

(D) TOPOLOGY: Linear 

(ii) MOLECULE TYPE: Genomic DNA 

(vi) ORIGINAL SOURCE: 

(A) ORGANISM: Enterococcus faecium 

(B) STRAIN: N97-330 

(C) ACCESSION NUMBER: AF175293 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 1594 

ATGTTTAGAA TTAAAGTTGC AGTTCTGTTT GGGGGCTGTT CAGAGGAACA 5 

TAATGTTTCG ATT^AAATCTG CGATGGAGAT TGCCGCAAAC ATAGATACAA 10 

AAAAGTATCA GCCTTATTAT ATTGGAATCA CAAAATCCGG CGTTTGGAAA 15 

ATGTGTGAAA AACCTTGTTT GGAGTGGGAA CAATATGCGG GGGATCCGGT 20 

TGTTTTTTCA CCGGACAGAA GTACGCATGG TCTGCTGATA CAAAAAGACA 25 

825 



SUBSTITUTE SHEET (RULE 26) 



wo 01/23604 



PCT/CAOO/01150 



CTGGGTATGA AATCCAGCCT GTGGATGTGG 
AAGTTTGGCG AGGATGGCTC CATACAAGGC 
TCCGTATGTG GGATGCGATA TTCAAAGCTC 
CGCTTGCATA TACCGTTGTG AAAAATGCGG 
5 CGGATCCTTC AGGAGGGGGA TCGCCTGGAA 
CGTTTTTGTA AAGCCTGCCC GTTCCGGCTC 
TATGCAAGGC AGAAGAACTG CAGGCAGCAA 
GACAGCAAGA TTTTGATTGA AGAGGCCGTT 
CGCCATACTG GGAAACGGAA ATGATCTCAT 

10 TTGAGCTGAG ACACGGCTTT TTTAAGATTC 
AAAGGATCTG AAAATGCAGT CATCCGAGTT 
GGTAATAGAA CGGATTCAGA AAACAGCAAT 
GCTGCAGAGG ATTGGCCCGC ATTGACCTGT 
ATTGTGTTGA ATGAAGTGAA TACCATGCCG 

15 TTATCCCCGC ATGATGACAG CAGCCGGTTT 
ATCGCTTGAT TGAACTTTCA CTTAGGAGGT 



GATTACCGAT GATTCATGGC 300 

TTGCTTGAAT TGTCAGGCAT 350 

CGTGACCTGC ATGGATAAGG 400 

GTATCGCTGT GCCTGGGTTC 450 

ACAGAGGATT TCGTATATCC 500 

ATCCTTTGGC GTAAACAAGG 550 

TCGAAGATGC AAGAAAATAT 600 

ACCGGGAGTG AGGTAGGCTG 650 

GGCTGGCGAG GTGGATCAGA 700 

ATCAGGAAGC ACAGCCGGAG 750 

CCAGCCGCCT TACCGGATGA 800 

GAAGATTTAC CGGATACTTG 850 

TTTTGCGGGA GGACGGCTGC 900 

GGTTTTACTT CCTACAGCCG 950 

TACGCTTACT GAAATACTGG 1000 

AA 1032 



20 2) INFORMATION FOR SEQ ID NO: 1595 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 25 bases 

(B) TYPE: Nucleic acid 
25 (C) STRANDEDNESS : Single 

(D) TOPOLOGY: Linear 

(ii) MOLECULE TYPE: DNA 
30 (xi) SEQUENCE DESCRIPTION: SEQ ID NO: 1595 

AAATAATGCT CCATCAATTT GCTGA 25 



35 



45 



50 



2) INFORMATION FOR SEQ ID NO: 1596 



(i) SEQUENCE CHARACTERISTICS: 
(A) LENGTH: 24 bases 
40 (B) TYPE: Nucleic acid 

(C) STRANDEDNESS: Single 

(D) TOPOLOGY: Linear 



(ii) MOLECULE TYPE: DNA 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 1596 
ATAGTCGAAA AAGCCATCCA CAAG 24 

2) INFORMATION FOR SEQ ID NO: 1597 
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(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 24 bases 

(B) TYPE: Nucleic acid 

(C) STRANDEDNESS: Single 

(D) TOPOLOGY: Linear 

(ii) MOLECULE TYPE: DNA 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 1597 
GATGAATTTG CGAAAATACA TGGA 



15 2) INFORMATION FOR SEQ ID NO: 1598 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 23 bases 

(B) TYPE: Nucleic acid 

2 0 (C) STRANDEDNESS: Single 

(D) TOPOLOGY: Linear 

(ii) MOLECULE TYPE: DNA 

25 (xi) SEQUENCE DESCRIPTION: SEQ ID NO: 1598 

CAGCCAATTT CTACCCCTTT CAC 



30 



40 



45 



2) INFORMATION FOR SEQ ID NO: 1599 



(i) SEQUENCE CHARACTERISTICS: 
(A) LENGTH: 604 bases 
35 (B) TYPE: Nucleic acid 

(C) STRANDEDNESS: Double 

(D) TOPOLOGY: Linear 



(ii) MOLECULE TYPE: Genomic DNA 

(vi) ORIGINAL SOURCE: 

(A) ORGANISM: Enterococcus faecalis 

(B) STRAIN: BM4405 

(C) ACCESSION NUMBER: AF136925 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 1599 



CAGGGAGTAT TTGAGTTATT AGATATTCCA TATGTAGGTT GTGGTATCGG 50 

AGCTGCAGCA ATTTCCATGA ATAAAATAAT GCTCCATCAA TTTGCTGAAA 100 

50 TAGTTGGTGT AAT^GCACT CCTAGTATGA TTATAGAAAA GGGACAAGAC 150 

CTACAAAAAG TCGATGAATT TGCGAAAATA CATGGATTTC CTTTATATAT 2 00 

CAAACCGAAT GAGGCAGGCT CATCAAAAGG AATTAGCAAG GTAGAACAAA 250 

AAAGTGATTT ATATAAAGCA ATAGACGAAG CTTCAAAATA TGATAGCCGT 300 
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ATTTTAATTC AAAAGGAAGT GAAAGGGGTA 
AGGGAATGAA CAATTGGTCG TTGGAGAATG 
ATGGCTTTTT CGACTATGAA GAGAAATACA 
TTGTTACCAG CTAAACTATC AATAGACAAA 
5 AGCAAAAAAA CTATACAGAC TATTAGGGTG 
ACTTTTTCTT AACGGATGAC GGAGAAATTT 
CTCC 
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GAAATTGGCT GTGGGATTTT 350 

TGATCAAATT AGTCTTGTGG 4 00 

ATTTAGTAAC AGCAGAAATT 450 

AAAGAAGACA TCCAGATAAA 500 

CAAAGGATTA GCGAGAATCG 550 

TATTAAATGA GATCAACACC 600 

604 



10 



20 



45 



2) INFORMATION FOR SEQ ID NO: 1600 



(i) SEQUENCE CHARACTERISTICS: 
(A) LENGTH: 805 bases 
15 (B) TYPE: Nucleic acid 

(C) STRANDEDNESS: Double 

(D) TOPOLOGY: Linear 



(ii) MOLECULE TYPE: Genomic DNA 

(vi) ORIGINAL SOURCE: 

(A) ORGANISM: Campylobacter jejuni subsp. doylei 

(B) STRAIN: ATCC 49349 

25 (xi) SEQUENCE DESCRIPTION: SEQ ID NO: 1600 

ACCTCAAATT AATGAAGCAA TTGTTGTAAA TTTTGAAAGC GAAGGAAAAA 50 

AACATAAACT TGTTTTAGAA GTAGCAGCTC ATTTAGGAGA TAATAGAGTT 100 

AGAACTATTG CTATGGATAT GACAGATGGT TTGGTAAGGG GCTTAAAAGC 150 

30 TGAGGCTTTA GGTGCTCCTA TTAGTGTTCC TGTTGGTGAG AAAGTTTTAG 200 

GAAGT^TTTT CAATGTTACT GGAGATTTGA TCGATGAAGG TGAAGAAATT 250 

CCTTTTGATA AAAAATGGGC AATTCATAGA GATCCGCCAG CTTTTGAAGA 300 

TCAAAGCACA AAAAGTGAGA TTTTTGAAAC AGGGATTAAA GTTGTAGATT 350 

TACTTGCTCC TTATGCAAAA GGTGGTAAAG TAGGTCTTTT TGGTGGTGCA 4 00 

35 GGTGTTGGTA AAACTGTTAT TATTATGGAG CTTATTCACA ATGTTGCATT 4 50 

TAAGCATAGC GGCTATTCTG TATTTGCAGG TGTGGGTGAG AGAACTCGTG 500 

AAGGAAATGA CCTTTATAAT GAAATGAAAG AAAGTAATGT TTTAGACAAA 550 

GTTGCTCTAT GTTATGGACA AATGAATGAA CCACCAGGAG CAAGAAATCG 600 

TATTGCTTTA ACAGGTTTAA CAATGGCTGA GTATTTTAGA GATGAAATGG 650 

4 0 GTCTTGATGT GCTTATGTTT ATTGATAATA TCTTTAGATT TTCACAATCA 700 

GGTTCTGAAA TGTCAGCACT TTTAGGAAGA ATTCCATCAG CTGTGGGTTA 750 

TCAACCAACC CTAGCAAGTG AAATGGGTAA ATTCCAAGAA AGAATTACTT 800 
CAACT 



2) INFORMATION FOR SEQ ID NO: 1601 



(i) SEQUENCE CHARACTERISTICS: 
50 (A) LENGTH: 826 bases 

(B) TYPE: Nucleic acid 

(C) STRANDEDNESS: Double 

(D) TOPOLOGY: Linear 



805 
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(ii) MOLECULE TYPE: Genomic DNA 

(vi) ORIGINAL SOURCE: 

(A) ORGANISM: Enterococcus sulfureus 

(B) STRAIN: ATCC 49903 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 1601 

CTCCTTACCA GATATCAACA ATGCGCTGGT TGTATATAAA AATGATGCCA 50 

ATAAAACAAA AGTTGTTTTA GAAGCAGCGT TAGAATTAGG TGACGGAATC 100 

ATTCGTGCGA TTGCAATGGA ATCGACTGAT GGATTGCAAC GTGGGATGGA 150 

AGTTGTCGAT ATGGGAGAAT CTATTTCTGT ACCAGTTGGA ACAGAAACTT 200 

TAGGACGTGT GTTTAATGTA TTAGGAGATA CGATTGACTT AGAAGCTCCT 250 

TTTCCTAAAG ATGCACCGCG TAGTGGAATC CATAAAAAAG CCCCTAATTT 300 

TGATGAATTA AGTACAAGCA CAGAAATTCT AGAAACAGGG ATCAAAGTCA 350 

TCGATCTATT AGCCCCTTAT TTAAAAGGTG GTAAAGTTGG ATTGTTTGGG 400 

GGAGCCGGTG TAGGAAAAAC GGTATTGATT CAAGAGTTGA TTCACAATAT 450 

CGCACAAGAA CACGGTGGGA TTTCGGTCTT TACTGGTGTT GGTGAGCGTA 500 

CACGTGAAGG AAACGATTTG TATTATGAAA TGAAAGATTC AGGAGTCATT 550 

GAAAAAACAG CGATGGTGTT TGGACAAATG AATGAGCCAC CTGGTGCACG 60 0 

GATGCGTGTA GCCTTGACTG GATTAACGAT TGCTGAATAT TTCCGTGATG 650 

TTGAAGGACA GGATGTACTG TTGTTTATCG ACAATATCTT CCGTTTTACA 700 

CAAGCTGGTT CTGAAGTGTC TGCCTTGTTA GGTCGTATGC CATCAGCTGT 750 

GGGATATCAA CCAACATTAG CTACCGAAAT GGGGCAATTG CAAGAGCGGA 800 

TCACGTCAAC GAAAAAAGGA TCGATC 826 



2) INFOEIMATION FOR SEQ ID NO: 1602 

(i) SEQUENCE CHAEyVCTERISTICS : 

(A) LENGTH: 833 bases 

(B) TYPE: Nucleic acid 

(C) STRANDEDNESS: Double 

(D) TOPOLOGY: Linear 

(ii) MOLECULE TYPE: Genomic DNA 

(vi) ORIGINAL SOURCE: 

(A) ORGANISM: Enterococcus solitarius 

(B) STRAIN: ATCC 4 9428 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 1602 

TGATACTTTA CCAQATATTA ATAATGCATT AGTAGTATAT AAAAAGGACG 5 

AGQACAAGAC ACGCGTTGTC TTAGAAGCCA CCTTGGAACT TGGAGATGGC 10 

ATGATTCGTG CAATCTCTAT GGGATCGACT GATGGCTTGC AACGGGGAAT 15 

GGAAGTTGTG GACACACAAG CACCTATTTC TGTTCCAGTA GGAAATGAAA 20 

CCTTAGGACG TGTTTTTAAT GTCTTAGGAG AAACGATTGA TAAACAGGCA 25 

CCGTTTCCTG AAGATGCCAA AAAAAGTGGT ATTCATAAAA AAGCACCCGC 30 

TTTTGATGAA TTAAGTACCA GTTCTGAAAT ATTGGAAACC GGGATTAAAG 35 

TAATCGATTT GCTAGCTCCT TATTTACGAG GTGGTAAAGT TGGATTATTT 40 

829 



SUBSTITUTE SHEET (RULE 26) 



wo 01/23604 

GGCGGTGCTG GCGTGGGTAA 
CGTTGCCCAA GAACATGGGG 
GTACTCGTGA AGGAAATGAC 
ATTGAAAAAA CGGCTATGGT 
5 ACGTATGCGT GTAGCGTTAA 
ATGTACAAGG TCAAGACGTA 
ACTCAAGCAG GAACAGAAGT 
CGTTGGTTAC CAACCAACTC 
GAATCACATC GACAGATAAA 



AACGGTATTA ATTCAAGAAT 
GAATTTCTGT TTTTACGGGT 
CTATATTATG AAATGCAGGA 
ATTTGGACAA ATGAACGAAC 
CTGGTTTGAC ACTTGCTGAG 
TTATTATTTA TAGATAATAT 
ATCTGCTTTA TTAGGACGGA 
TAGCAACGGA T^TGGGACAG 
GGATCAATTA CCT 
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TAATTCATAA 


450 


GTCGGAGAGC 


500 


TTCAGGCGTT 


550 


CCCCTGGTGC 


600 


TACTTCCGTG 


650 


TTTCCGCTTT 


700 


TGCCGTCTGC 


750 


TTGCAAGAAC 


800 




833 



2) INFORMATION FOR SEQ ID NO: 1603 

15 (i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 806 bases 

(B) TYPE: Nucleic acid 

(C) STRANDEDNESS: Double 

(D) TOPOLOGY: Linear 



20 



50 



(ii) MOLECULE TYPE: Genomic DNA 



(vi) ORIGINAL SOURCE: 

(A) ORGANISM: Campylobacter sputorum subsp. sputorum 
25 (B) STRAIN: ATCC 35980 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 1603 

ATCAGCCTAA AATTAACGT^ GCAATAG7AG TTAATTATGA ATTAGATGGT 50 

30 AAAAAAATAA GACTTATTCT TGAAGTAGCT GGACATCTTG GCGATAATAG 100 

AGCAAGAACC ATTGCTATGG ATATGAGTGA TGGTTTACAA AGAGGATTAG 150 

AAGTTACGGC TCTTGGTGCT CCTATAACAG TTCCTGTTGG AGATAAAGTT 200 

TTAGGTAGAA TGTTTAATGT TGTAGGTGAC TTAATAGATG AAGGTGAAGT 250 

AACAGATTTT GATAAAAGAT GGGCTATCCA TAGAGATCCT CCTTCGTTTG 300 

35 AAGATCAAAG TACAAAAAGT GAAATTTTTG AAACAGGTAT AAAAGTAGTT 3 50 

GATCTTCTTG CACCTTATGC AAAAGGTGGT AAAGTTGGCT TATTTGGTGG 400 

TGCTGGCGTT GGAAT^AACAG TTATCATAAT GGAGCTTATA CATAATGTTG 450 

CATTTAAACA CAGCGGTTAT TCAATTTTTG CCGGTGTTGG AGAGAGAACA 500 

AGAGAGGGAA ATGATCTTTA TAATGAGATG AAAGAGTCTG GTGTTTTGGA 550 

40 TAAAGTTGCC TTATGTTATG GACAAATGAA TGAACCACCA GGAGCAAGAA 600 

ACCGTATAGC ATTAACAGGT CTTACAATGG CTGAATATTT CCGTGATGAA 650 

ATGGGGCTTG ATGTGTTGAT GTTTATAGAT AATATTTTTA GATTTTCTCA 700 

ATCAGGTTCT GAAATGTCAG CGCTGCTTGG TAGAATTCCA TCTGCTGTTG 750 

GTTATCAACC AACATTAGCA AGTGAGATGG GAAAACTTCA AGAAAGAATT 800 

45 ACTTCC ^06 



2) INFORMATION FOR SEQ ID NO: 1604 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 738 bases 

(B) TYPE: Nucleic acid 
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(C) STRANDEDNESS: Double 

(D) TOPOLOGY: Linear 



(ii) MOLECULE TYPE: Genomic DNA 

5 

(vi) ORIGINAL SOURCE: 

(A) ORGANISM: EntGrococcus pseudoavium 

(B) STRAIN: ATCC 4 9372 



10 (xi) SEQUENCE DESCRIPTION: SEQ ID NO: 1604 

GGTGTTATCC GCACAATCGC TATGGAATCT ACAGATGGAT TGCAACGGGG 50 

GATGGAAGTT GTCGATACCG GCAAACCAAT CTCTGTTCCT GTAGGTAAAG 100 

AAACATTAGG TCGTGTGTTT AACGTATTAG GTGAAACGAT CGACAAAGAA 150 

15 GCACCTTTTC CAGAAGATGT AGAAAAGAGC GGTATTCACA AAAAGGCCCC 200 

CGCTTTTGAA GACCTTAGCA CCAGTAATGA GATTTTAGAA ACTGGGATCA 2 50 

AGGTTATCGA CTTATTAGCC CCTTACTTAA AAGGTGGTAA AGTTGGACTA 3 00 

TTCGGTGGTG CCGGTGTTGG TAAAACCGTC TTAATTCAAG AACTGATTCA 350 

TAATATCGCC CAAGAACACG GTGGGATTTC TGTCTTTACC GGGGTTGGGG 400 

20 AACGGACTCG TGAAGGGAAC GACCTTTATT ATGAAATGAA AGAATCCGGC 450 

GTTATTGAAA AAACAGCGAT GGTCTTCGGA CAAATGAATG AGCCACCAGG 500 

TGCGCGGATG CGCGTTGCCT TGACTGGTTT GACATTAGCT GAATATTTCC 550 

GTGATGAAGA AGGTCAAGAT GTGTTGCTAT TTATCGATAA CATTTTCCGC 600 

TTCACACAAG CCGGATCAGA AGTTTCGGCG CTATTAGGTC GGATGCCATC 650 

25 TGCCGTTGGT TATCAACCAA CCTTGGCAAC AGAAATGGGT CAATTACAAG 700 

AACGAATCAC TTCAACGAAA AAAGGCTCAA TTACATCG 738 



30 2) INFORMATION FOR SEQ ID NO: 1605 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 20 bases 

(B) TYPE: Nucleic acid 
35 (C) STRANDEDNESS: Single 

(D) TOPOLOGY: Linear 

(ii) MOLECULE TYPE: DNA 
40 (xi) SEQUENCE DESCRIPTION: SEQ ID NO: 1605 

ATYATYGAAR TITAYGCICC 20 



45 

2) INFORMATION FOR SEQ ID NO: 1606 

(i) SEQUENCE CHARACTERISTICS: 
(A) LENGTH: 23 bases 
50 (B) TYPE: Nucleic acid 

(C) STRANDEDNESS: Single 

(D) TOPOLOGY: Linear 
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(ii) MOLECULE TYPE: DNA 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 1606 
5 CCRAACATIA YICCIACTTT TTC 23 



10 



2) INFORMATION FOR SEQ ID NO: 1607 



(i) SEQUENCE CHARACTERISTICS: 
•(A) LENGTH: 336 bases 

(B) TYPE: Nucleic acid 

(C) STRANDEDNESS: Double 
15 (D) TOPOLOGY: Linear 

(ii) MOLECULE TYPE: Genomic DNA 

(vi) ORIGINAL SOURCE: 
20 (A) ORGANISM: Klebsiella ornithinolytica 

(B) STRAIN: ATCC 31898 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 1607 

25 CTGGATTATG CGATGTCGGT CATTGTTGGC CGTGCGCTGC CGGATGTCCG 50 

AGATGGCCTG AAACCGGTAC ACCGTCGCGT ACTTTACGCC ATGAACGTAT 100 

TGGGCAATGA CTGGAACAAA GCCTATAAAA AATCCGCCCG TGTCGTTGGT 150 

GACGTAATCG GTAAATACCA CCCTCATGGT GATACCGCCG TTTATGACAC 200 

CATTGTACGT ATGGCACAGC CATTCTCCTT GCGTTATATG CTGGTCGATG 250 

30 GCCAGGGTAA CTTCGGTTCT GTCGATGGCG ACTCCGCCGC AGCGATGCGT 300 

TATACGGAAA TCCGTATGTC GAAAATCGCC CACGAG 336 



35 2) INFORMATION FOR SEQ ID NO: 1608 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 341 bases 

(B) TYPE: Nucleic acid 
40 (C) STRANDEDNESS: Double 

(D) TOPOLOGY: Linear 

(ii) MOLECULE TYPE: Genomic DNA 

45 (vi) ORIGINAL SOURCE: 

(A) ORGANISM: Klebsiella oxytoca 

(B) STRAIN: ATCC 13182 



50 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 1608 

CTATCTGGAT TATGCGATGT CGGTCATTGT TGGCCGTGCG CTGCCGGATG 50 
TCCGAGATGG CCTGAAGCCG GTACACCGTC GCGTACTATA CGCCATGAAC 100 
GTATTGGGCA ATGACTGGAA CAAAGCCTAT AAAAAATCTG CCCGTGTCGT 150 
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GGGTGACGTC ATCGGTAAAT ACCACCCTCA 
ACACCATTGT ACGTATGGCG CAGCCATTCT 
GATGGCCAGG GTAACTTTGG TTCGGTCGAC 
GCGTTATACG GAAATCCGTA TGTCGAAGAT 



TGGTGATACT GCCGTATACG 200 

CCCTGCGTTA CATGCTGGTA 250 

GGCGACTCCG CCGCAGCGAT 300 

CGCCCATGAA C 341 



2) INFORMATION FOR SEQ ID NO: 1609 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 22 bases 

(B) TYPE: Nucleic acid 

(C) STRANDEDNESS: Single 

(D) TOPOLOGY: Linear 

(ii) MOLECULE TYPE: DNA 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 1609 
GCCCTGATCC AAATAGCATA TA 



2) INFORMATION FOR SEQ ID NO: 1610 

(i) SEQUENCE CHAEIACTERISTICS : 

(A) LENGTH: 23 bases 

(B) TYPE: Nucleic acid 

(C) STRANDEDNESS: Single 

(D) TOPOLOGY: Linear 

(ii) MOLECULE TYPE: DNA 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 1610 
CCTGGCATAA CAGTAACATT CTG 



2) INFORMATION FOR SEQ ID NO: 1611 

(i) SEQUENCE CHARACTERISTICS: 
• (A) LENGTH: 22 bases 

(B) TYPE: Nucleic acid 

(C) STRANDEDNESS: Single 

(D) TOPOLOGY: Linear 

(ii) MOLECULE TYPE: DNA 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 1611 
TGGGAAAAAG CAACTCCATC TC 
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/ 



10 



15 



30 



(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 23 bases 

(B) TYPE: Nucleic acid 

(C) STRANDEDNESS: Single 

(D) TOPOLOGY: Linear 

(ii) MOLECULE TYPE: DNA 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 1612 
ACAACTGAAT TCGCAGCAAC AAT 23 

2) INFORMATION FOR SEQ ID NO: 1613 



(i) SEQUENCE CHARACTERISTICS: 
20 (A) LENGTH: 639 bases 

(B) TYPE: Nucleic acid 

(C) STRANDEDNESS: Double 

(D) TOPOLOGY: Linear 

25 (ii) MOLECULE TYPE: Genomic DNA 

(vi) ORIGINAL SOURCE: 

(A) ORGANISM: Staphylococcus aureus 
(C) ACCESSION NUMBER: U19459 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 1613 



ATGAAATATG GCCCTGATCC AAATAGCATA TATCCACATG AAGAAATAAA 50 

AAGTGTTTGT TTTATTAAAA ATACAATTAC CAATCCAAAT ATTATAGTTG 100 

35 GAGATTATAC TTACTATTCC GATGTTAACG GAGCTGAAAA ATTTGAAGAA 150 

CATGTGACAC ATCATTATGA ATTTAGGGGT GATAAACTTG TAATTGGCAA 200 

GTTTTGTGCA ATAGCTGAAG GTATAGAATT TATTATGAAT GGAGCAAACC 250 

ATAGAATGAA TTCAATAACA ACTTATCCTT TTAATATAAT GGGAAATGGT 300 

TGGGAAAAAG CAACTCCATC TCTTGAAGAT TTACCATTTA AGGGAGATAC 350 

4 0 TGTTGTTGGA AATGATGTGT GGATTGGTCA GAATGTTACT GTTATGCCAG 400 

GAATTCAZIAT AGGAGATGGA GCAATTGTTG CTGCGAATTC AGTTGTTACA 450 

AAAGATGTAC CACCATATCG TATTATTGGT GGAAATCCGA GTAGAATTAT 500 

AAAGAAAAGG TTTGAAGATG AATTGATAGA TTATTTATTG CAAATAAAAT 550 

GGTGGGATTG GTCAGCACAA AAAATATTTT CTAATCTTGA AACACTTTGT 600 

45 AGCTCTGATT TAGAGAAAAT AAAATCTATT CGAGATTAG 639 



2) INFORMATION FOR SEQ ID NO: 1614 

50 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 22 bases 

(B) TYPE: Nucleic acid 
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(C) STRANDEDNESS: Single 

(D) TOPOLOGY: Linear 

(ii) MOLECULE TYPE: DNA 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 1614 
CCAATCCAGA AGAAATATAC CC 



2) INFORMATION FOR SEQ ID NO: 1615 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 24 bases 

(B) TYPE: Nucleic acid 

(C) STRANDEDNESS: Single 

(D) TOPOLOGY: Linear 

(ii) MOLECULE TYPE: DNA 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 1615 
ATTAGTTTAT CCCCAATCAA TTCA 



2) INFORMATION FOR SEQ ID NO: 1616 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 26 bases 

(B) TYPE: Nucleic acid 

(C) STRANDEDNESS: Single 

(D) TOPOLOGY: Linear 

(ii) MOLECULE TYPE: DNA 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 1616 
ATAATGAATG GGGCTAATCA TCGTAT 



2) INFORMATION FOR SEQ ID NO: 1617 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 24 bases 

(B) TYPE: Nucleic acid 

(C) STRANDEDNESS: Single 

(D) TOPOLOGY: Linear 

(ii) MOLECULE TYPE: DNA 
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(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 1617 
GCCAACAACT GAATAAGGAT CAAC 24 

5 

2) INFORMATION FOR SEQ ID NO: 1618 

(i) SEQUENCE CHARACTERISTICS: 
10 (A) LENGTH: 639 bases 

(B) TYPE: Nucleic acid 

(C) STRANDEDNESS: Double 

(D) TOPOLOGY: Linear 

15 (ii) MOLECULE TYPE: Genomic DNA 

(vi) ORIGINAL SOURCE: 

(A) ORGANISM: Staphylococcus cohnii 
(C) ACCESSION NUMBER: AF015628 

20 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 1618 

ATGAAATGGC AAAATCAGCA AGGCCCCAAT CCAGAAGAAA TATACCCTAT 50 

AGAAGGTAAT AAACATGTTC AATTTATTAA ACCATCTATA ACAAAGCCCA 100 

25 ATATTTTAGT TGGGGAATAT TCATATTACG ATAGTAAAGA TGGTGAATCT 150 

TTTGAAAGCC AAGTTCTTTA TCACTATGAA TTGATTGGGG ATAAACTAAT 200 

ATTAGGGAAG TTTTGTTCTA TTGGACCCGG AACGACATTT ATAATGAATG 250 

GGGCTAATCA TCGTATGGAT GGTTCAACAT TTCCATTCAA TCTTTTCGGA 300 

AATGGTTGGG AGAAGCATAC CCCTACATTG GAAGACCTTC CTTATAAGGG 350 

30 TAACACGGAA ATTGGGAACG ATGTTTGGAT TGGACGAGAT GTGACAATTA 400 

TGCCCGGTGT AAAAATAGGA AACGGGGCTA TTATTGCAGC AAAATCGGTT 450 

GTGACAAAGA ACGTTGATCC TTATTCAGTT GTTGGCGGTA ATCCTTCACG 500 

ATTAATTAAG ATAAGGTTTT CCAAGGAAAA AATCGCAGCA TTACTAAAAG 550 

TAAGGTGGTG GGACCTAGAG ATAGAGACGA TAAATGAAAA TATTGATTGC 600 

35 ATCCTGAATG GTGATATAAA AAAGGTTAAA AGAAGTTAG 639 



40 



2) INFORMATION FOR SEQ ID NO: 1619 



(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 24 bases 

(B) TYPE: Nucleic acid 

(C) STRANDEDNESS: Single 
45 (D) TOPOLOGY: Linear 

(ii) MOLECULE TYPE: DNA 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 1619 

50 

AAGGCAAAAT AAAAGGAGCA AAGC 24 
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2) INFORI^TION FOR SEQ ID NO: 1620 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 23 bases 

(B) TYPE: Nucleic acid 

(C) STRANDEDNESS: Single 

(D) TOPOLOGY: Linear 

(ii) MOLECULE TYPE: DNA 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 1620 
TGTACCCGAG ACATCTTCAC CAC 



2) INFORMATION FOR SEQ ID NO: 1621 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 26 bases 

(B) TYPE: Nucleic acid 

(C) STRANDEDNESS: Single 

(D) TOPOLOGY: Linear 

(ii) MOLECULE TYPE: DNA 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 1621 
AATTGAAGGA CGGGTATTGT GGAAAG 



2) INFORMATION FOR SEQ ID NO: 1622 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 2 6 bases 

(B) TYPE: Nucleic acid 

(C) STRANDEDNESS: Single 

(D) TOPOLOGY: Linear 

(ii) MOLECULE TYPE: DNA 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 1622 
CGATTTTGAC AGATGGCGAT AATGAA 



2) INFORMATION FOR SEQ ID NO: 1623 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 1569 bases 

(B) TYPE: Nucleic acid 
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10 



15 



20 



25 



30 



35 



40 



. 45 



(C) STRANDEDNESS: Double 

(D) TOPOLOGY: Linear 

(ii) MOLECULE TYPE: Genomic DNA 

(vi) ORIGINAL SOURCE: 

(A) ORGANISM: Staphylococcus aureus 
(C) ACCESSION NUMBER: M90056 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 1623 

ATGAAAATAA TGTTAGAGGG ACTTAATATA AAACATTATG TTCAAGATCG 50 
TTTATTGTTG AACATAAATC GCCTAAAGAT TTATCAGAAT GATCGTATTG 100 
GTTTAATTGG TAAAAATGGA AGTGGAAAAA CAACGTTACT TCACATATTA 150 
TATAAAAAAA TTGTGCCTGA AGAAGGTATT GTAAAACAAT TTTCACATTG 200 
TGAACTTATT CCTCAATTGA AGCTCATAGA ATCAACTAAA AGTGGTGGTG 250 
AAGTAACACG AAACTATATT CGGCAAGCGC TTGATAAAAA TCCAGAACTG 300 
CTATTAGCAG ATGAACCAAC AACTAACTTA GATAATAACT ATATAGAAAA 350 
ATTAGAACAG GATTTAAT^lAA ATTGGCATGG AGCATTTATT ATAGTTTCAC 4 00 

ATGATCGCGC TTTTTTAGAT AACTTGTGTA CTACTATATG GGAAATTGAC 4 50 

GAGGGAAGAA TAACTGAATA TAAGGGGAAT TATAGTAACT ATGTTGAACA 500 
AAAAGAATTA GAAAGACATC GAGAAGAATT AGAATATGAA AAATATGAAA 550 
AAGAAAAGAA ACGATTGGAA AAAGCTATAA ATATAAAAGA ACAGAAAGCT 600 
CAACGAGCAA CTAAAAAACC GAAAAACTTA AGTTTATCTG AAGGCAAAAT 650 
AAAAGGAGCA AAGCCATACT TTGCAGGTAA GCAAAAGAAG TTACGAAAAA 700 
CTGTAAAATC TCTAGAAACC AGACTAGAAA AACTTGAAAG CGTCGAAAAG 7 50 

AGAAACGAAC TTCCTCCACT TAAAATGGAT TTAGTGAACT TAGAAAGTGT 800 
AAAAAATAGA ACTATAATAC GTGGTGAAGA TGTCTCGGGT ACAATTGAAG 850 
GACGGGTATT GTGGAAAGCA AAAAGTTTTA GTATTCGCGG AGGAGACAAG 900 
ATGGCAATTA TCGGATCTAA TGGTACAGGA AAGACAACGT TTATTAAAAA 950 

AATTGTGCAT GGGAATCCTG GTATTTCATT ATCGCCATCT GTCAAAATCG 1000 

GTTATTTTAG CCAAAAAATA GATACATTAG AATTAGATAA GAGCATTTTA 1050 

GAAAATGTTC AATCTTCTTC ACAACAAAAT GAAACTCTTA TTCGAACTAT 1100 

TCTAGCTAGA ATGCATTTTT TTAGAGATGA TGTTTATAAA CCAATAAGTG 1150 

TCTTAAGTGG TGGAGAGCGA GTTAAAGTAG CACTAACTAA AGTATTCTTA 1200 

AGTGAAGTTA ATACGTTGGT ACTAGATGAA CCAACAAACT TTCTTGATAT 1250 

GGAAGCTATA GAGGCGTTTG AATCTTTGTT AAAGGAATAT AATGGCAGTA 1300 

TAATCTTTGT ATCTCACGAT CGTAAATTTA TCGAAAAAGT AGCCACTCGA 1350 

ATAATGACAA TTGATAATAA AGAAATAAAA ATATTTGATG GCACATATGA 14 00 

ACAATTTAAA CAAGCTGAAA AGCCAACAAG GAATATTAAA GAAGATAAAA 14 50 

AACTTTTACT TGAGACAAAA ATTACAGAAG TACTCAGTCG ATTGAGTATT 1500 

GAACCTTCGG AAGAATTAGA ACAAGAGTTT CAAAACTTAA TAAATGAAAA 1550 

AAGAAATTTG GATAAATAA 1569 



2) INFORMATION FOR SEQ ID NO: 1624 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 26 bases 

(B) TYPE: Nucleic acid 

(C) STRANDEDNESS: Single 

(D) TOPOLOGY: Linear 
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2) INFORMATION FOR SEQ ID NO: 1628 

5 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 1658 bases 

(B) TYPE: Nucleic acid 

(C) STRANDEDNESS: Double 
10 (D) TOPOLOGY: Linear 

(ii) MOLECULE TYPE: Genomic DNA 

(vi) ORIGINAL SOURCE: 
15- (A) ORGANISM: Staphylococcus aureus 

(C) ACCESSION NUMBER: U82085 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 1628 



2 0 ATGCTTAAAA 
i\ATTTTAAAT 
TTGTAGGTAA 
GGACTAATAG 
TATTAAATAT 

25 GAAAATATGC 
AGTGGTGGTG 
ATGTTCTTTA 
GAATAGAGTT 
GTAGTATCTC 

30 TGAAATTGAA 
ATATAGAGCA 
AAGTATAATT 
GAATAAGGCG 
CTGAATCTAG 

35 CATAGAAATA 
AGAAAAAATA 
AGCT^TTTCA 
TTTAATAATC 
AGAAAAGGTT 

4 0 TGAAGCTGAT 
AAAATTGGAT 
TATCTTAGAA 
GAATTGTTCT 
ATAAATGTTT 

45 ATTTGTTAGC 
TGGATATCGA 
GGTGTTGTGT 
TGAACAATTG 
CATATATAGA 

50 GAAAAAGAAC 
ATTATTAAGT 
TAGATGAAAA 
AATATTTA 



TCGACATGAA 
ATAAAAGAAC 
GAATGGAGTT 
AGATTGACGA 
ATCTCTCAAT 
TTCAATATTT 
AAAAAACTAG 
ATGCTCGTAG 
GATAACAAAT 
ATGATAGAAT 
AATGGATATA 
AAAAGAAATG 
CTAAAAGAAA 
CAAGGAATGA 
AATATGGAAA 
CGAAATCGTT 
AAAGAGCTTC 
TGGTCGCAAT 
AATTTCTTTG 
GCTATAATTG 
TCTAGAAAAA 
ACGTCAGTCA 
AATGTTATGT 
AGCAAGATTA 
TGAGTGGTGG 
GATTGTAATA 
TGCTGTTGAG 
TATTTGCTTC 
TTAATT^TAG 
ATATTTAAAA 
TCAAAGAAAA 
AAAATCTCAA 
GTACAAATTG 



GAATGTAAAA 
TAAAGATTTA 
GGCAAAACAA 
AGGAAATATA 
TAGAAGAACC 
CAAGTTGAAA 
ATTTAAACTA 
ATGAACCTAC 
ACTTTTAAAG 
TTTTTTAGAT 
TTAGAGAATT 
CTTCTACGAA 
GCAATTGGAG 
TTAAGCCCCC 
ATGCAACATG 
GGAAACACGA 
CTTCTATTAA 
GTAATTAGTT 
GAGAGATGCT 
GTAACAATGG 
GTAGAATCAG 
AAACTTAGAT 
CTACCTCCAT 
CATTTTTATC 
AGAACAAATA 
CATTAATTCT 
GCATTAGAAG 
CCATGATT^AA 
AAAATAATAA 
ATTAAAGATA 
AAAGATGATA 
TGGAAGAAAA 
AAATTAAAAG 



AAATATTATG 


CAGATAAATT 


50 


TAGTGGGGAT 


AAAATAGGTA 


100 


CACTTTTAAA 


AATAATAAAA 


150 


ATTATAAGTG 


AAAAAACAAC 


200 


ACATAGTAAG 


ATAATTGATG 


250 


ATAAGTGGAA 


TGACAATATG 


300 


GCAGAGGGAT 


TTCAAGATCA 


350 


;\AGTAATTTA 


GATATCGAAG 


400 


AGTACCGTGA 


TACTTTTTTG 


450 


CAAGTTTGTA 


CAAAAATTTT 


500 


CATCGGTAAT 


TATACAAACT 


550 


AGCAACAAGA 


AGAATACGAA 


600 


CAAGCTATAA 


AGCTAAAAGA 


650 


TTCAAAAACA 


ATGGGAACAT 


700 


CTACTAAACA 


AAAAAAGATG 


750 


ATAGATAAAT 


TAAATCATGT 


800 


AATGGATTTA 


CCTAATAGAG 


850 


TAAAAAACTT 


ATCTATAAAA 


900 


TCATTTGTCA 


TTAAAGGTGG 


950 


TGTAGGAAAA 


ACAACATTGT 


1000 


TAATAATATC 


ACCATCAGTT 


1050 


GTTCTACAAT 


CTCATAAATC 


1100 


TCAAGATGAA 


ACAATAGCAA 


1150 


GCAATGATGT 


TCATAAAGAA 


1200 


AAGGTTGCTT 


TTGCCAAGCT 


1250 


TGATGAACCA 


ACAAACTATT 


1300 


AATTGTTAAT 


TACCTATGAA 


1350 


AAATTTATAC 


AAAACCTAGC 


1400 


AGTGAAAAAA 


TTCGAAGGAA 


1450 


AACCAAAATT 


AAATACAAAT 


1500 


CTAGAAATGC 


AAATTTCATC 


1550 


TGAAGAAAAA 


AACAAAGAAT 


1600 


AATTGAAAAG 


CCTAAATAAA 


1650 






1658 
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2) INFORMATION FOR SEQ ID NO: 1629 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 26 bases 

(B) TYPE: Nucleic acid 

(C) STRANDEDNESS: Single 

(D) TOPOLOGY: Linear 

(ii) MOLECULE TYPE: DNA 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 1629 
AAGGGGAAAG TTTGGATTAC ACAACA 



2) INFORMATION FOR SEQ ID NO: 1630 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 24 bases 

(B) TYPE: Nucleic acid 

(C) STRANDEDNESS: Single 

(D) TOPOLOGY: Linear 

(ii) MOLECULE TYPE: DNA 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 1630 
GAACCACAGG GCATTATCAG AACC 



2) INFORMATION FOR SEQ ID NO: 1631 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 21 bases 

(B) TYPE: Nucleic acid 

(C) STRANDEDNESS: Single 

(D) TOPOLOGY: Linear 

(ii) MOLECULE TYPE: DNA 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 1631 
CGACGATGCT TTATGGTTTG T 



2) INFOEIMATION FOR SEQ ID NO: 1632 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 26 bases 

(B) TYPE: Nucleic acid 
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20 



(C) STRANDEDNESS: Single 

(D) TOPOLOGY: Linear 

(ii) MOLECULE TYPE: DNA 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 1632 
GTT/y^TTTGC CTATCTTGTC ACACTC 26 

2) INFORMATION FOR SEQ ID NO: 1633 



(i) SEQUENCE CHARACTERISTICS: 
(A) LENGTH: 900 bases 
15 (B) TYPE: Nucleic acid 

(C) STRANDEDNESS: Double 

(D) TOPOLOGY: Linear 



(ii) MOLECULE TYPE: Genomic DNA 

(vi) ORIGINAL SOURCE: 

(A) ORGANISM: Staphylococcus aureus 
(C) ACCESSION NUMBER: M36022 

25 (xi) SEQUENCE DESCRIPTION: SEQ ID NO: 1633 

ATGGAATTTA AATTACAAGA ATTAAATCTT ACTAACCAAG ATACAGGACC 50 

ATATGGTATA ACCGTTTCAG ATAAGGGGAA AGTTTGGATT ACACAACATA 100 

AAGCAAATAT GATAAGTTGC ATCAATTTAG ATGGAAAAAT TACAGAGTAC 150 

30 CCACTACCGA CACCAGATGC AAAAGTCATG TGTTTAACTA TATCCTCAGA 200 

TGGGGAAGTT TGGTTTACTG AGAATGCAGC AAACAAAATA GGGAGGATTA 250 

CAAAAAAAGG GATTATTAAG GAATATACAT TGCCTAACCC AGATTCAGCA 300 

CCCTACGGTA TTACAGAAGG ACCAAATGGA GATATATGGT TTACAGAAAT 350 

GAATGGCAAC CGTATTGGAC GTATTACGGA CGACGGTAAA ATTCGTGAAT 400 

35 ACGAGCTGCC TAATAAAGGA TCTTACCCTT CTTTTATCAC TTTGGGTTCT 4 50 

GATAATGCCC TGTGGTTCAC AGAAAATCAA AATAATGCTA TTGGTAGAAT 500 

TACAGAAAGT GGGGATATTA CAGAGTTTAA AATTCCTACA CCTGCATCAG 550 

GACCAGTTGG TATTACAAAG GGGAACGACG ATGCTTTATG GTTTGTGGAA 600 

ATTATCGGTA ATAAGATAGG GCGAATAACT CCTCTGGGGG AAATTACCGA 650 

40 ATTCAAAATT CCAACGCCAA ACGCTCGACC TCATGCAATT ACTGCTGGAG 700 

CAGGAATTGA TTTATGGTTT ACTGAATGGG GGGCTAATAA AATAGG/U^GG 750 

CTGACAAGCA ATAATATAAT TGAGGAATAC CCAATTCAAA TCAAAAGTGG 800 

TGAACCACAT GGCATTTGTT TCGATGGTGA AACAATTTGG TTTGCAATGG 850 

AGTGTGACAA GATAGGCAAA TTAACTCTCA TTAAGGATAA TATGGAGTGA 900 

45 



2) INFORMATION FOR SEQ ID NO: 1634 

50 (i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 25 bases 

(B) TYPE: Nucleic acid 

(C) STRANDEDNESS: Single 
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(D) TOPOLOGY: Linear 

(ii) MOLECULE TYPE: DNA 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 1634 
TTAACTTGTC TATTCCCGAT TCAGG 



2) INFORMATION FOR SEQ ID NO: 1635 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 23 bases 

(B) TYPE: Nucleic acid 

(C) STRANDEDNESS: Single 

(D) TOPOLOGY: Linear 

(ii) MOLECULE TYPE: DNA 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 1635 
GCTGTGGCAA TGGATATTCT GTA 



2) INFORMATION FOR SEQ ID NO: 1636 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 24 bases 

(B) TYPE: Nucleic acid 

(C) STRANDEDNESS: Single 

(D) TOPOLOGY: Linear 

(ii) MOLECULE TYPE: DNA 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 1636 
TTCCTACCCC TGATGCTAAA GTGA 



2) INFORMATION FOR SEQ ID NO: 1637 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 23 bases 

(B) TYPE: Nucleic acid 

(C) STRANDEDNESS: Single 

(D) TOPOLOGY: Linear 

(ii) MOLECULE TYPE: DNA 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 1637 
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5 



15 



35 



45 



CAAAGTGCGT TATCCGAACC TAA 23 



2) INFOEy>4ATION FOR SEQ ID NO: 1638 



(i) SEQUENCE CHARACTERISTICS: 
(A) LENGTH: 527 bases 
10 (B) TYPE: Nucleic acid 

(C) STRANDEDNESS: Double 

(D) TOPOLOGY: Linear 



(ii) MOLECULE TYPE: Genomic DNA 

(vi) ORIGINAL SOURCE: 

(A) ORGANISM: Aspergillus fumigatus 

(B) STRAIN: WSA-172 

20 (xi) SEQUENCE DESCRIPTION: SEQ ID NO: 1638 

GGTGGGGAAG ACTGTCTTCA TCCAGGAATT GATTGTGAGT CGTTCCACAT 50 

GCTCACCTAG TTTTCGCTCG ATCTTTTCAC TAACGCAAAC CATGTAGAAC 100 

AACATTGCCA AGGCCCACGG TGGTTACTCC GTCTTCACTG GTGTTGGTGA 150 

25 GCGTACTCGT GAGGGTAACG ATCTGTACCA CGAAATGCAG GAGACTGGTG 200 

TCATTCAGCT CGAGGGTGAA TCCAAGGTCG CACTGGTGTT CGGACAGATG 250 

AACGAGCCCC CCGGTGCCCG TGCCCGTGTC GCCCTTACCG GTCTGACCAT 300 

TGCCGAGTAC TTCCGTGACG AGGAGGGTCA GGACGTGCTG CTCTTCATTG 350 

ACAACATTTT CCGTTTCACC CAGGCCGGTT CTGAGGTGTC TGCCCTTCTC 400 

30 GGTCGTATCC CCTCTGCCGT CGGTTACCAG CCCACCCTGG CCGTCGACAT 4 50 

GGGTGGTATG CAGGAGCGTA TCACCACCAC CAAGAAGGGT TCTATTACCT 500 

CCGTCCARGC CGTCTACGTC CCCGCGA 527 



2) INFORMATION FOR SEQ ID NO: 1639 



(i) SEQUENCE CHARACTERISTICS: 
(A) LENGTH: 452 bases 
40 (B) TYPE: Nucleic acid 

(C) STRANDEDNESS: Double 

(D) TOPOLOGY: Linear 



(ii) MOLECULE TYPE: Genomic DNA 

(vi) ORIGINAL SOURCE: 

(A) ORGANISM: Aspergillus fumigatus 

(B) STRAIN: ATCC 64746 

50 (xi) SEQUENCE DESCRIPTION: SEQ ID NO: 1639 

ACTGTCTTGA TTC/IAGAACT TATTAACAAC ATTGCTAAAG CCCATGGTGG 50 
TTACTCTATT TTCTGTGGTG TTGGTGAACG TACTCGTGAA GGTAACGATT 100 
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TATACCACGA AATGATGGAA ACTGGTGTCA TTAAACTTGA AGGTGACTCC 150 

AAGTGTGCTC TTGTATTCGG TCAAATGAAC GAACCTCCTG GTGCTCGTGC 200 

CCGTGTTGCT TTAACTGGTT TAACCATTGC TGAATACTTC CGTGATGAAG 250 

AAGGTCAAGA TGTGTTACTT TTCATTGATA ACATTTTCCG TTTCACTCAA 300 

GCTGGTTCTG AAGTATCTGC CCTTTTAGGT CGTATTCCAT CTGCTGTAGG 350 

TTACCAACCC ACTTTATCTA CTGATATGGG TGGTATGCAA GAACGTATTA 4 00 

CTACTACCAA GAATGGTTCC ATTACCTCTG TACARGCCGT CTACGTCCCC 450 

GC 452 



2) INFORMATION FOR SEQ ID NO: 1640 



(i) SEQUENCE CHARACTERISTICS: 
15 (A) LENGTH: 783 bases 

(B) TYPE: Nucleic acid 

(C) STRANDEDNESS: Double 

(D) TOPOLOGY: Linear 

20 (ii) MOLECULE TYPE: Genomic DNA 

(vi) ORIGINAL SOURCE: 

(A) ORGANISM: Bacillus mycoldes 

(B) STRAIN: ATCC 6462 
25 (C) ACCESSION NUMBER: 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 1640 

ACGCCCTTAC GGTAAAACAA AGCAACGAAA ACGGAAGCAT GAACTTAACA 50 

30 TTTGAAGTTG CACTTCATTT AGGTGATGAT ACAGTTCGTA CAGTTGCGAT 100 

GTCTTCCACA GATGGACTTG TTCGTGGCAC AGAAGTAGAA GATACTGGTA 150 

AAGCAATCTC TGTACCAGTT GGTGATGCAA CACTTGGTCG TGTATTCAAC 200 

GTATTAGGTG ATGCAATTGA CTTAGATGGT GAACTTCCTG CGGATGTACA 250 

CCGTGATCCA ATTCACCGTC AAGCACCTGC ATTCGAAGAA TTATCTACTA 300 

35 AAGTAGAAAT TCTTGAAACT GGTATTAAAG TAGTAGACTT ACTTGCTCCT 350 

TACATTAAGG GTGGTAAGAT CGGCCTATTC GGTGGTGCCG GCGTAGGTAA 400 

AACAGTATTA ATTCAAGAAT TAATTAACAA CATCGCACAA GAGCACGGTG 450 

GTATCTCTGT ATTCGCTGGT GTAGGTGAGC GTACTCGTGA GGGTAATGAC 500 

TTATACCATG AAATGAGCGA TTCTGGCGTA ATCAAGAAAA CTGCGATGGT 550 

40 ATTCGGACAA ATGAATGAGC CACCTGGAGC ACGTCAACGT GTTGCATTAA 600 

CAGGTTTAAC AATGGCTGAG CATTTCCGTG ATGAGCAAGG ACAAGACGTA 650 

CTTCTGTTCA TCGATAACAT CTTCCGTTTC ACGCAAGCGG GTTCTGAAGT 700 

ATCTGCCCTT CTTGGTCGTA TGCCATCTGC GGTAGGTTAC CAACCAACAC 750 

TTGCAACAGA AATGGGTCAA TTACAAGAGC GTA 783 

45 



2) INFORMATION FOR SEQ ID NO: 1641 

50 (i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 823 bases 

(B) TYPE: Nucleic acid 

(C) STRANDEDNESS: Double 
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(ii) MOLECULE TYPE: Genomic DNA 



5 (vi) ORIGINAL SOURCE: 

(A) ORGANISM: Bacillus my co ides 

(B) STRAIN: NRRL NRS-319 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 1641 



10 



ATCTACAACG CCCTTACGGT AAAACAAAGC AACGAAAACG GAGCAAGCAT 50 

TAACTTAACA TTTGAAGTTG CACTTCATTT AGGTGATGAC ACAGTTCGTA 100 

CAGTTGCAAT GTCTTCCACA GATGGACTTG TTCGTGGCAC AGAAGTAGAA 150 

GATACTGGTA AAGCAATCTC TGTACCAGTT GGTGATGCAA CACTTGGTCG 200 

15 TGTATTCAAC GTATTAGGTG ATGCAATTGA CTTAGATGGT GATGTTCCTG 250 

CGGATGTACG TCGTGATCCA ATTCACCGTC AAGCACCTGC ATTCGAAGAA 300 

CTATCTACTA AAGTAGAAAT TCTTGAAACT GGTATTAAAG TAGTAGACTT 350 

ACTTGCTCCT TACATTAAGG GTGGTAAGAT CGGTCTATTC GGTGGTGCCG 4 00 

GTGTAGGTAA AACAGTATTA ATTCAGGAAT TAATTAACAA CATCGCACAA 4 50 

20 GAACACGGTG GTATCTCTGT ATTCGCTGGT GTAGGTGAGC GTACTCGTGA 500 

GGGTAACGAC TTATACCATG AAATGAGCGA TTCTGGCGTA ATTAAGAAAA 550 

CTGCGATGGT ATTCGGACAA ATGAATGAGC CACCTGGAGC ACGTCAACGT 600 

GTTGCATTAA CAGGTTTAAC AATGGCTGAA CATTTCCGTG ATGAGCAAGG 650 
ACAAGACGTA CTATTGTTCA TCGATAACAT CTTCCGTTTC ACGCAAGCAG ' 700 

25 GTTCTGAAGT ATCTGCCCTT CTTGGTCGTA TGCCATCTGC GGTAGGTTAC 7 50 

CAACCAACAC TTGCAACAGA AATGGGTCAA TTACTU^GAGC GTATTACATC 800 

TACAAATAAA GGATCTATCA CGT 823 



30 

2) INFORMATION FOR SEQ ID NO: 1642 



(i) SEQUENCE CHARACTERISTICS: 
(A) LENGTH: 829 bases 
35 (B) TYPE: Nucleic acid 

(C) STRANDEDNESS: Double 

(D) TOPOLOGY: Linear 



(ii) MOLECULE TYPE: Genomic DNA 

40 

(vi) ORIGINAL SOURCE: 

(A) ORGANISM: Bacillus my co ides 

(B) STRAIN: NRRL BD-15 



45 (xi) SEQUENCE DESCRIPTION: SEQ ID NO: 1642 

GATGGCGGAA AGCTACCAGA AATCTACAAC GCCCTTACGG TAAAACAAAG 50 

CAACGAAAAC GGAGCAAACA TTAACTTAAC ATTTGAAGTT GCACTTCATT 100 

TAGGTGATGA TACAGTTCGT ACAGTTGCGA TGTCTTCCAC AGATGGACTT 150 

50 GTTCGTGGCA CAGAAGTAGA AGATACTGGT AAAGCAATCT CTGTACCAGT 200 

TGGTGATGTA ACACTTGGTC GTGTATTCAA CGTATTAGGT GATGCAATTG 250 

ACTTAGATGG TGAAGTTCCT GCGGATGTAC GTCGTGATCC AATTCACCGT 300 
CAAGCACCTG CATTCGAAGA ATTATCTACT T^AAGTAGAAA TTCTTGAAAC . 350 
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TGGTATTAAA GTAGTAGACT TACTTGCTCC 
TTGGTCTATT CGGTGGTGCC GGTGTAGGTA 
TTAATTAACA ACATCGCACA AGAACACGGT 
TGTAGGTGAG CGTACTCGTG AAGGTAACGA 
5 ATTCTGGCGT AATTAAGAAA ACTGCGATGG 
CCACCTGGAG CACGTCAACG TGTTGCATTA 
GCATTTCCGT GATGAGCAAG GACAAGACGT 
TCTTCCGTTT CACGCAAGCA GGTTCTGAAG 
ATGCCATCTG CGGTAGGTTA CCAGCCAACA 
10 ATTACAAGAG CGTATTACAT CTACAAATA 



PCT/CAOO/01150 

TTACATTAAG GGTGGTAAGA 400 

AAACAGTATT AATTCAGGAA 4 50 

GGTATCTCTG TATTCGCCGG 500 

CTTATACCAC GAAATGAGCG 550 

TATTCGGACA AATGAACGAG 600 

ACAGGTTTAA CAATGGCTGA 650 

ACTACTGTTC ATCGATAACA 700 

TATCTGCCCT TCTTGGTCGT 7 50 

CTTGCAACAG AAATGGGTCA 800 

829 



2) INFORMATION FOR SEQ ID NO: 1643 



15 



(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 823 bases 

(B) TYPE: Nucleic acid 

(C) STRANDEDNESS: Double 

2 0 (D) TOPOLOGY: Linear 

(ii) MOLECULE TYPE: Genomic DNA 

(vi) ORIGINAL SOURCE: 
25 (A) ORGANISM: Bacillus pseudomycoides 

(B) STRAIN: NRRL BD-10 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 1643 

3 0 AATCTACAAT GCCCTTACGG TAAAACAAAG CAACGAAAAC GGAGCAAGCA 50 

TTAACTTAAC ATTTGAAGTT GCACTTCATT TAGGTGATGA CACAGTTCGT 100 

ACAGTTGCGA TGTCTTCCAC AGATGGACTT GTTCGTGGCA CAGAAGTAGA 150 

AGATACTGGT AAACCAATCT CTGTACCGGT TGGTGATGCA ACACTTGGTC 200 

GTGTATTTAA CGTATTAGGT GATGCAATTG ACTTAGATGG TGAAGTTCCT 250 

35 GCGGATGTAC GCCGTGATCC AATTCACCGT CAAGCACCTG CGTTCGAAGA 300 

GTTATCTACG AAAGTAGAAA TTCTTGAAAC TGGTATTAAA GTAGTAGACT 350 

TACTTGCTCC TTACATTAAA GGTGGTAAAA TCGGTCTATT CGGTGGTGCC 400 

GGTGTAGGTA AAACAGTATT AATCCAGGAA TTAATTAACA ACATCGCACA 4 50 

AGAGCACGGT GGTATTTCTG TATTCGCTGG TGTAGGTGAG CGTACTCGTG 500 

40 AAGGTAATGA CTTATACCAC GAAATGAGCG ATTCTGGCGT AATCAAGAAA 550 

ACAGCGATGG TATTCGGACA AATGAACGAG CCACCTGGTG CACGTCAACG 600 

TGTTGCATTA ACAGGATTAA CAATGGCTGA ACATTTCCGT GATGAGCAAG 650 

GACAAGACGT ACTATTGTTC ATCGATAACA TCTTCCGTTT CACGCAAGCG 700 

GGTTCTGAAG TATCTGCCCT TCTTGGTCGT ATGCCATCTG CGGTAGGTTA 750 

45 CCAACCAACT CTTGCAACAG AAATGGGTCA ATTACAAGAG CGTATTACAT 800 

CTACAAATAA AGGATCTATC ACG 823 



50 2) INFORMATION FOR SEQ ID NO: 1644 

(i) SEQUENCE CHARACTERISTICS: 
(A) LENGTH: 708 bases 
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(B) TYPE: Nucleic acid 

(C) STRANDEDNESS: Double 

(D) TOPOLOGY: Linear 

5 (ii) MOLECULE TYPE: Genomic DNA 

(vi) ORIGINAL SOURCE: 

(A) ORGANISM: Bacillus pseudomycoides 

(B) STRAIN: NRRL B-617 

10 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 1644 

GGAAAGCTAC CTGAAATCTA CAATGCCCTT AGAATTAAAC AAGACGCAGT 50 

TAACTTAACT TTAGAAGTTG CACTTCACTT AGGTGATGAT ACAGTTCGTA 100 

15 CAGTTGCGAT GTCTTCCACA GACGGACTTG TTCGTGGTAC TGCAGTAGAA 150 

GATACTGGCA AAGCGATTTC TGTTCCAGTT GGTGATGCAA CACTTGGTCG 200 

TGTATTTAAC GTATTAGGTG ATGCAATTGA CTTAGATGGT GAGGTTCCAG 250 

CAGATGTACG CCGTGATCCA ATTCACCGTC AAGCACCTGC ATTCGAAGAG 300 

TTATCTACAA AAGTAGAAAT TCTTGAAACT GGTATTAAAG TAGTAGACTT 350 

20 ACTTGCTCCT TACATTAAAG GTGGTAAAAT CGGTCTATTC GGTGGTGCCG 400 

GTGTAGGTAA AACAGTATTA ATTCAGGAAT TAATTAACAA CATCGCACAA 450 

GAGCACGGTG GTATCTCTGT ATTCGCTGGT GTAGGTGAGC GTACTCGTGA 500 

AGGTAACGAC TTATACCATG AAATGAGCGA TTCTGGCGTA ATCAAGAAAA 550 

CTGCGATGGT ATTCGGACAA ATGAACGAGC CACCTGGTGC ACGTCAACGT 600 

25 GTTGCATTAA CAGGTTTAAC AATGGCTGAA CATTTCCGTG ATGAGCAAGG 650 

ACAAGACGTA CTATTGTTCA TCGATAACAT CTTCCGTTTC ACGCAAGCGG 700 

GTTCTGAA 708 



30 



40 



2) INFORMATION FOR SEQ ID NO: 1645 



(i) SEQUENCE CHARACTERISTICS: 
(A) LENGTH: 778 bases 
35 (B) TYPE: Nucleic acid 

(C) STRANDEDNESS: Double 

(D) TOPOLOGY: Linear 



(ii) MOLECULE TYPE: Genomic DNA 

(vi) ORIGINAL SOURCE: 

(A) ORGANISM: Budvicia aquatica 

(B) STRAIN: ATCC 35567 

45 (xi) SEQUENCE DESCRIPTION: SEQ ID NO: 164 5 

GTACGATGCG CTTGAAGTTC AAAACGATGT CAAACTAGTG CTGGAAGTTC 50 

AACAGCAGCT CGGTGGTGGT GTCGTTCGTT GTATCGCTAT GGGTACCTCA 100 

GATGGCCTAA GCCGCGGTCT GAAAGTACTT GATTTAGAAC ACCCTATCGA 150 

50 AGTTCCTGTC GGTGTGGCAA CTCTGGGCCG TATTATGAAC GTGCTTGGTG 200 

AGCCTATCGA TATGAAGGGT GACATTGGTG AAGAAGAGCG TTGGGCTATC 250 

CATCGCGCTG CACCAAGCTA CGAAGATTTA TCCAGTGCTA ACGAACTGCT 300 

GGAAACGGGC ATCAAGGTTA TCGACCTGAT TTGTCCGTTT GCCAAAGGCG 350 

848 
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GTAAAGTTGG TCTGTTTGGT GGTGCCGGCG 

ATGGAGCTGA TTCGTAATAT TGCGACTGAG 

TGCCGGCGTT GGTGAGCGTA CTCGTGAGGG 

TGACAGAATC TAACGTATTA GACAAAGTAT 

5 AATGAGCCAC CGGGAAACCG TCTGCGCGTT 

GGCGGAAAAA TTCCGTGATG AAGGCCGTGA 

ACATTTATCG TTATACCTTA GCCGGTACAG 

CGTATGCCAT CAGCGGTAGG TTACCAACCA 
TACACTGCAA GAACGTATCA CYTCAACC 



PCT/CAOO/01150 

TAGGTAAAAC GGTAAACATG* 400 

CACTCAGGTT ACTCTGTATT 4 50 

TAATGACTTC TACCACGAAA 500 

CTCTGGTTTA TGGCCAAATG 550 

GCGTTAACCG GCCTGACCAT 600 

CGTTCTGCTG TTTATCGATA 650 

AAGTATCAGC ACTGTTAGGT 700 

ACGCTGGCGG AAGAGATGGG 750 

778 



2) INFORMATION FOR SEQ ID NO: 1646 

15 (i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 806 bases 

(B) TYPE: Nucleic acid 

(C) STRANDEDNESS: Double 

(D) TOPOLOGY: Linear 



20 



(ii) MOLECULE TYPE: Genomic DNA 



(vi) ORIGINAL SOURCE: 

(A) ORGANISM: Buttlauxella agrestis 
25 (B) STRAIN: ATCC 33320 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 1646 

GATGCCGTAC CAAAAGTGTA CGACGCTCTT GAGGTTACAA ATGGTAAAGA 50 

3 0 CCGTCTGGTG CTGGAAGTTC AGCAACAGTT AGGTGGTGGC GTAGTGCGTA 100 

CTATCGCCAT GGGTACTTCT GATGGTTTGC GTCGTGGTCT GGAAGTTTCT 150 

AACCTCGATC ACCCAATTGA AGTGCCAGTA GGTAAAGCAA CTCTGGGCCG 200 

TATCATGAAC GTCCTGGGCG AGCCTATCGA CATGAAAGGC GATATCGGCG 250 

AAGAAGAGCG TTGGGCGATT CACCGTGCTG CTCCTAGCTA CGAAGAACTG 300 

35 TCTAGCTCCC AGGATCTGCT GGAAACCGGC ATCAAAGTAA TGGACCTGAT 350 

TTGCCCGTTC GCTAAGGGTG GTAAAGTCGG TCTGTTCGGT GGTGCGGGTG 400 

TGGGTAAAAC TGTAAACATG ATGGAGCTGA TCCGTAACAT CGCGATCGAG 450 

CACTCCGGTT ACTCAGTGTT TGCAGGCGTG GGGGAACGTA CTCGTGAGGG 500 

TAACGACTTC TACCATGAAA TGACCGATTC CAACGTTCTG GACAAAGTAT 550 

40 CACTGGTTTA TGGCCAGATG T^ACGAGCCAC CAGGTT^CCG TCTGCGCGTT 600 

GCGTTGACCG GTCTGACCAT GGCTGAGAAG TTCCGTGACG AAGGTCGTGA 650 

CGTTCTGCTG TTCGTTGATA ACATTTACCG TTATACCCTG GCCGGTACAG 700 

AAGTATCTGC GCTGCTGGGT CGTATGCCAT CTGCGGTAGG TTACCAGCCA 750 

ACTCTGGCAG AAGAGATGGG TGTTTTGCAG GAGCGTATTA CCTCCACCAA 800 

45 AACTGG 806 



2) INFORMATION FOR SEQ ID NO: 1647 

50 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 1122 bases 

(B) TYPE: Nucleic acid 

849 



SUBSTITUTE SHEET (RULE 26) 



wo 01/23604 



PCT/CAOO/01150 



(C) STEU^NDEDNESS: Double 

(D) TOPOLOGY: Linear 

(ii) MOLECULE TYPE: Genomic DNA 

(vi) ORIGINAL SOURCE: 

(A) ORGANISM: Candida norvegica 

(B) STRAIN: ATCC 36586 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 1647 

GAAACCGTTG TTGACACTGG TTCCCCAATC ACCGTCCCAG TTGGTCGTGA 50 

AACCTTGGGT CGTATCATTA ACGTTATCGG TGAACCAATC GATGAACGTG 100 

GTCCAATCAA CACCAAGCAA AGAAACCCAA TCCATGCTGA TCCTCCTTCA 150 

TTTGCTGAAC AATCCACTTC TGCTGAAGTT TTAGAAACCG GTATTAAAGT 200 

TGTCGATTTA TTGGCTCCTT ATGCCAGAGG TGGTAAAATT GGTTTATTCG 250 

GTGGTGCCGG TGTCGGTAAA ACCGTCTTTA TCCAAGAATT GATTAACAAC 300 

ATTGCTAAAG CTCATGGTGG TTTCTCAGTC TTCACCGGTG TCGGTGAAAG 3 50 

AACCAGAGAA GGTAACGATT TATACCGTGA AATGAAAGAA ACCGGTGTCA 4 00 

TTAACTTGGA AGGTGACTCT AAAGTCGCTT TAGTCTTCGG TCAAATGAAC 450 

GAACCTCCAG GTGCTAGAGC CCGTGTTGCC TTAACCGGTC TAACCATTGC 500 

TGAATACTTC AGAGATGAAG AAGGTCAAGA TGTCTTGTTA TTCGTTGATA 550 

ACATTTTCAG ATTCACCCAA GCTGGTTCAG AAGTCTCTGC CTTATTAGGT 600 

CGTATTCCAT CTGCTGTCGG TTATCAACCA ACCTTGGCCA CTGATATGGG 650 

TTTGTTGCAA GAACGTATCA CCACCACCAG AAAAGGTTCC GTCACTTCTG 700 

TCCAAGCTGT TTATGTCCCA GCTGATGATT TGACTGATCC TGCCCCAGCC ,750 

ACCACTTTCG CGCATTTGGA TGCTACTACT GTCTTGTCTC GTGGTATCTC 800 

AGAATTGGGT ATCTACCCAG CTGTCGATCC ATTGGATTCC AAATCAAGAT 850 

TATTAGATGC CGCTGTTGTT GGTCAAGAAC ATTACGATGT TGCTACTCAA 900 

GTTCAACAAA CTTTACAAGC CTACAAATCT TTACAAGATA TTATCGCTAT 950 

TTTAGGTATG GATGAATTAT CAGAACAAGA TAAATTGACC GTTGAAAGAG 1000 

CTAGAAAGAT CCAACGTTTC TTATCTCAAC CATTTGCTGT CGCTGAAGTT 1050 

TTCACTGGTA TTCCAGGTAG ATTAGTTAGA TTGAAAGAAA CCGTTGCTTC 1100 

ATTCAGAGAC GTTTTAGCTG GT 1122 



2) INFORMATION FOR SEQ ID NO: 1648 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 813 bases 

(B) TYPE: Nucleic acid 

(C) STRANDEDNESS: Double 

(D) TOPOLOGY: Linear 

(ii) MOLECULE TYPE: Genomic DNA 

(vi) ORIGINAL SOURCE: 

(A) ORGANISM: Streptococcus pneumoniae 

(B) STRAIN: ATCC 700677 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 1648 
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TACGTCTCTT 
TGTTTCAAAT 
GTT^CGTTTC 
GGATCAACTA 
TGTCTACGAT 
CTGGGACAGA 
ATTACTCTGC 
AACACTAAAC 
TTGGTATCGA 
ACAACTGAAT 
CGCCTATGCC 
TCAATAAAGT 
GGCACAAGAG 
GAAAACAGTC 
TTCCTCAAGC 
GAAAAGTATA 



ACCCTGACGA 
GGTAAAGTCA 
ATTTGGTACC 
TGAAACCAAT 
TCAACTGCTA 
TATCCCTCTC 
AATATGCCCT 
AAGGTCGGTC 
CTATCCAAGC 
CCAACAAAAA 
GCATTCGCAA 
TGTCTTTAGC 
CCATGAAAGA 
TGGACGTACG 
TGGTAAAACA 
TCAAGAACAC 



TGATTTGCAA 
TCGCCCAACT 
AACCAAGCTG 
CACAGACTAT 
CTATCGTTCA 
TATAACTGGG 
TCAACAATCT 
TAGATAAGGC 
ATGCATTATG 
ATATGGTGCA 
ATGGTGGTAT 
GATGGTAGCG 
AACGACTGCT 
Gi\ACTGGTCG 
GGTACCTCTA 
TGGTTACGTA 



GTCGCATCTA CGGTCGTAGA 
TGGAGCTCGT CACCAAGCAA 
TGGAAACCAA TCGTGACTGG 
GCTCCTGCCT TGGAGTACGG 
CGATGAGCCC TATAACTACC 
ATCGAGCATA TTTCGGTAAT 
CGTAACGTAC CTGCCGTTGA 
TAAAACCTTC CTTAATGGTC 
CAAACGCCAT TTCAAGTAAT 
AGTAGTGAAA AAATAGCTAC 
TTACCACAAA CCAATGTACA 
AAAAAGAATT TTCTGACCCT 
TACATGATGA CAGAAATGAT 
T6GTGCCTAC CTGCCTTGGC 
ACTATACTGA CGAAGAAATT 
GCTCCAGATG AAATGTTTGT 



50 
100 
150 
200 
250 
300 
350 
400 
450 
500 
550 
600 
650 
700 
750 
800 
813 



GGGTTATACC CGT 



2) INFORMATION FOR SEQ ID NO: 1649 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 813 bases 

(B) TYPE: Nucleic acid 

(C) STRANDEDNESS: Double 

(D) TOPOLOGY: Linear 

(ii) MOLECULE TYPE: Genomic DNA 

(vi) ORIGINAL SOURCE: 

(A) ORGANISM: Campylobacter lari 

(B) STRAIN: ATCC 43675 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 1649 

ATTAATGAAG CTATTGTTGT TAATTATGAA TTAGAAGGAA AAGAATGCAA 50 

GCTAGTTCTT GAAGTAGCTG CACATTTAGG CGATAATAAA GTAAGAACCA 100 

TCGCTATGGA TATGACAGAT GGTCTTGTTA GAGGTTTAAC AGCCGTCGCA 150 

ACTGGAAATC CAATTAGTGT TCCAGTAGGC GAAAAAGTTC TTGGAAGAAT 200 

TTTTAATGTA ACGGGTGATT TGATTGATGA GGGCGAAGAA ATCAATTTTG 2 50 

ATAAGCACTG GTCAATTCAT AGAGATCCAC CTCCATTTGA AGAACAAAGT 3 00 

ACAAAAAGCG AAATCTTTGA AACAGGTATA AAGGTTGTTG ATTTGCTAGC 350 

TCCTTATGCT AAAGGTGGAA AAGTTGGTCT TTTTGGTGGT GCAGGTGTTG 4 00 

GTAAAACCGT TATTATTATG GAATTAATTC ACAATGTTGC ATTTAAACAT 450 

AGCGGATATT CTGTTTTTGC AGGTGTTGGC GAAAGAACTC GTGAGGGTAA 500 

TGACCTTTAC AATGAAATGA AAGAAAGTAA TGTATTAGAT AAAGTTGCAT 550 

TGTGTTATGG TCAAATGAAT GAACCACCAG GGGCAAGAAA TCGTATAGCT 600 

TTAACAGGTC TTACTATGGC TGAGTATTTT AGAGATGAAA TGGGACTTGA 650 

TGTTTTAATG ATTATTGATA ATATTTTCAG ATTTTCTCAA TCAGGCTCAG 700 

AAATGTCAGC ACTTTTAGGA AGAATTCCTT CAGCTGTTGG TTATCAACCA 750 

ACCTTAGCTA GTGAAATGGG TAAGTTCCAA GAAAGAATTA CTTCAACCAA 800 

GAAAGGATCT ATT 813 



851 
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2) INFORMATION FOR SEQ ID NO: 1650 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 570 bases 

(B) TYPE: Nucleic acid 

(C) STRANDEDNESS: Double 

(D) TOPOLOGY:- Linear 

(ii) iVIOLECULE TYPE: Genomic DNA 

(vi) ORIGINAL SOURCE: 

(A) ORGANISM: Coccidioides immitis 

(B) STRAIN: WSA-222 

15 (xi) SEQUENCE DESCRIPTION: SEQ ID NO: 1650 

GGTGGGGAAG ACTGTGTTCA TTCAGGAATT GATTGTAAGT TCTGTTATCA 50 

ACTAAAGCCG ACAGCGGTTG CTGATATGCT CTAGAACAAC ATTGCTAAGG lOQ 

CTCACGGTGG TTACTCCGTG TTCACTGGTG TCGGTGAGCG TACCCGTGAG 150 

20 GGTAACGATT TGTACCATGA AATGCAAGAG ACCCGTGTCA TTCAACTCGA 200 

CGGAGAGTCC AAGGTCGCTC TTGTCTTCGG TCAAATGAAC GAGCCCCCTG 250 

GTGCCCGTGC CCGTGTTGCC CTTACCGGTT TGACCATTGC TGAATACTTC 300 

CGTGACGAGG AAGGCCAAGA CGGTAGGCTT CATGCTTCTA TCGCTAGGGG 350 

CGTGTGATAC AGGAGGCTAA TCGCTTTTCT AGTGCTTCTC TTTATTGACA 4 00 

25 ACATTTTCCG TTTCACTCAA GCTGGTTCTG AAGTGTCTGC CTTGCTCGGT 4 50 

CGTATTCCTT CCGCTGTCGG TTACCAACCT ACTCTCGCCG TCGACATGGG 500 

TGTTATGCAG GAACGTATCA CCACCACCAC CAAGGGATCC ATTACTTCAG 550 

TGCARGCCGT CTACGTCCCC 570 



30 



45 



2) INFORMATION FOR SEQ ID NO: 1651 



(i) SEQUENCE CHARACTERISTICS: 
35 (A) LENGTH: 560 bases 

(B) TYPE: Nucleic acid 

(C) STRANDEDNESS: Double 

(D) TOPOLOGY: Linear 

4 0 (ii) MOLECULE TYPE: Genomic DNA 

(vi) ORIGINAL SOURCE: 

(A) ORGANISM: Emmonsla parva 

(B) STRAIN: ATCC 10784 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 1651 



TGTCTTCATC CAGGAGTTAA TTGTACGTTT TTTGTCTGCC TACTATAATA 50 

GACGAAGAAA TATTTTGATT TATTTTTCTA ATAATTCAAA TAGAACAACA 100 

50 TTGCCAAAGC CCACGGTGGT TACTCTGTCT TCACTGGTGT TGGCGAGCGG 150 

ACCCGTGAAG GAAACGATCT GTACCACGAA ATGCAGGAGA CCCGTGTCAT 200 

CCAGCTCGAT GGCGAGTCCA AGGTCGCCCT CGTCTTCGGT CAAATGAACG 250 

AACCCCCCGG AGCCCGTGCC CGTGTTGCCC TCACTGGTGT GACCGTTGCT 300 

852 
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GAATACTTCC GTGACGAGGA 

CCAGTAATTT GACTCGAAGC 

CATCGACAAC ATTTTCCGCT 

TGCTCGGCCG TATCCCCTCC 

5 GACATGGGTA TGATGCAGGA 
CACCTCCGTG 



AGGTCAAGAT GGTGCGTATA 
TCCACTCACA CATATATTAG 
TCACCCAGGC AGGTTCCGAA 
GCCGTCGGTT ACCAGCCCAC 
ACGTATCACC ACCACCACCA 



PCT/CAOO/01150 

TATATATTCG 350 
TGCTCCTCTT 4 00 

GTGTCCGCCC 450 
CCTCGCTGTC 500 
AGGGCTCCAT 550 

560 



10 2) INFORMATION FOR SEQ ID NO: 1652 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 780 bases 

(B) TYPE: Nucleic acid 
15 (C) STRANDEDNESS: Double 

(D) TOPOLOGY: Linear 

(ii) MOLECULE TYPE: Genomic DNA 

20 (vi) ORIGINAL SOURCE: 

(A) ORGANISM: Erwinia amylovora 

(B) STRAIN: ATCC 14 97 6 



25 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 1652 



CCACAAGTGT ACAGCGCCCT TGAGGTTAT^ AATGGTGATG CTCGTCTGGT 50 

GCTGGAAGTT CAGCAGCAGC TGGGCGGTGG CGTGGTTCGT ACCATCGCCA 100 

TGGGTTCTTC AGACGGCCTT AAGCGTGGTC TGGAAGCCGT TGACCTTCAG 150 

CACCCAATTG AAGTACCGGT AGGTACTGCC ACACTTGGCC GTATCATGAA 200 

30 CGTGCTGGGT GAGCCGATCG ATATGAAAGG CGACATTGGC GAAGAAGAGC 250 

GCTGGGCGAT TCACCGCTCT GCACCTTCTT ACGAAGATCA GTCGAACTCT 300 

CAGGATCTGC TGGAAACCGG CATCAAGGTG ATTGACCTGA TGTGTCCGTT 350 

CGCCAAGGGC GGTAAAGTCG GCTTGTTCGG TGGTGCGGGC GTAGGTAAAA 4 00 

CCGTCAACAT GATGGAGCTT ATTCGTAACA TTGCGGCTGA GCACTCAGGT 4 50 

35 TTCTCGGTAT TTGCCGGTGT GGGTGAGCGT ACCCGTGAAG GTAACGACTT 500 

CTACCACGAA ATGACCGACT CCAACGTTAT CGACAAAGTT TCGCTGGTCT 550 

ATGGTCAGAT GAACGAGCCA CCGGGTAACC GTCTGCGCGT TGCGCTGACC 600 

GGTCTGACCA TGGCGGAGAA GTTCCGTGAC GAAGGTCGTG ACGTACTGCT 650 

GTTTATCGAT AACATCTATC GTTACACCCT GGCCGGTACT GAAGTCTCCG 700 

40 CTCTGCTGGG TCGTATGCCT TCTGCGGTAG GTTACCAGCC AACGCTGGCG 750 

GAAGAGATGG GCGTTCTGCA GGAACGTATC 780 



45 2) INFOEy^TION FOR SEQ ID NO: 1653 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 545 bases 

(B) TYPE: Nucleic acid 
50 (C) STRANDEDNESS: Double 

(D) TOPOLOGY: Linear 

(ii) MOLECULE TYPE: Genomic DNA 

853 
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(vi) ORIGINAL SOURCE: 

(A) ORGANISM: Fonsecaea pedrosoi 

(B) STRAIN: ATCC 18831 

5 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 1653 

TGTGTTCATT CAGGAGCTGA TTGTGAGTAC CCCGGAGATT TTCCTGCGAT 50 

TGCGCATGAA GCAAGCGCTG ACGTCCATCT AGAACAACAT CGCCAAGGCT 100 

10 CACGGTGGTT ACTCCGTGTT CTGCGGTGTC GGCGAGCGTA CTCGTGAGGG 150 

TAACGATTTG TACCACGAAA TGCAGGAGAC CGGTGTCATC AACCTCGAGG 200 

GCGAGTCCAA GGTCGCCCTT GTCTTCGGTC AGATGAACGA GCCCCCGGGA 250 

GCCCGTGCCC GTGTCGCCCT TACTGGTCTT ACCGTCGCTG AGTAAGTTTT 300 

GACAACCAGA AGCGAGTATT GCCACAATTA CTGACTAAAA ATCAAGATAT 350 

15 TTCCGTGACG AGGAGGGCCA GGATGTGCTT CTCTTCATTG ACAACATTTT 400 

CCGTTTCACC CAGGCCGGTT CTGAGGTGTC CGCTCTTCTC GGCCGTATTC 4 50 

CCTCTGCCGT CGGTTACCAG CCCACTCTCG CCGTCGACAT GGGTATGATG 500 

CAGGAGCGTA TCACCACCAC CCAGAAGGGT TCCATCACTT CCGTC 545 



20 



35 



2) INFORMATION FOR SEQ ID NO: 1654 



(i) SEQUENCE CHARACTERISTICS: 
25 (A) LENGTH: 564 bases 

(B) TYPE: Nucleic acid 

(C) STRANDEDNESS : Double 

(D) TOPOLOGY: Linear 

30 (ii) MOLECULE TYPE: Genomic DNA 

(vi) ORIGINAL SOURCE: 

(A) ORGANISM: Fusarium moniliforme 

(B) STRAIN: WSA-213 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 1654 



TGTGTTTATT CAGGAGCTCA TCAACAACAT CGCCAAGGCC CACGGTGGTT 50 

ACTCTGTCTT CACTGGTGTC GGTGAGCGAA CCCGTGAGGG TAACGATCTG 100 

40 TACCACGAT^ TGCAGGAGAC TTCCGTTATT CAGCTTGATG GCGAGTCCAA 150 

GGTTGCCCTG GTTTTCGGTC AGATGAACGA GCCCCCTGGA GCTCGTGCCC 200 

GTGTCGCTCT TACCGGGTAA GTTGATAGAT AGTGCCTTCC CTTCCTGTTT 250 

CAACACCTCA ACACCACCAC TCCCCAAGAC ATTGCTACTC ATACTGCACC 300 

ATGATATTAT ATTTACGCCT CTTGGACGCT AGCTAATGTT GTATCGACAG 350 

45 TTTGACTGTT GCTGAGTACT TCAGAGACGA GGAGGGTCAG GACGTGCTGC 400 

TTTTCATTGA CAACATTTTC CGATTCACTC AGGCCGGTTC CGAGGTGTCT 450 

GCCCTTCTCG GTCGTATCCC CTCTGCCGTC GGTTACCAGC CCACTCTGGC 500 

CGTCGACATG GGTGGTATGC AGGAGCGTAT TACCACCACC ACCAAGGGTT 550 

CCATTACCTC AGTC 564 

50 



2) INFORMATION FOR SEQ ID NO: 1655 
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(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 776 bases 

(B) TYPE: Nucleic acid 

(C) STRANDEDNESS: Double 

(D) TOPOLOGY: Linear 

(ii) MOLECULE TYPE: Genomic DNA 

(vi) ORIGINAL SOURCE: 

(A) ORGANISM: Klebsiella oxytoca 

(B) STRAIN: ATCC 13182 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 1655 

CGTACCGCGC GTGTACGAGG CTCTTGAGGT ACAAAATGGT AGTGAGAATC 50 

TGGTGCTGGA AGTTCAGCAG CAGCTCGGCG GCGGTATTGT TCGTACCATC 100 

GCCATGGGTT CTTCCGACGG TCTGCGTCGC GGTCTGGAAG TCAAAGACCT 150 

CGAGCATCCG ATCGAAGTCC CGGTAGGTAA AGCAACGCTG GGTCGTATCA 200 

TGAACGTACT GGGCCAACCG GTAGACATGA AAGGCGACAT CGGCGAAGAA 250 

GAGCGTTGGG CGATTCACCG CGCAGCGCCT TCCTACGAAG AGTTGTCAAA 300 

CTCTCAGGAA CTGCTGGAAA CCGGCATCAA AGTTATCGAC CTGATGTGTC 350 

CGTTTGCGAA GGGCGGTAAA GTTGGTCTGT TCGGTGGTGC GGGTGTAGGT 400 

AAAACCGTAA ACATGATGGA GCTGATCCGT AACATCGCGA TCGAGCACTC 4 50 

CGGTTACTCC GTGTTTGCGG GCGTAGGTGA ACGTACTCGT GAGGGTAACG 500 

ACTTCTACCA CGAAATGACC GACTCCAACG TTATCGATAA AGTATCCCTG 550 

GTGTATGGCC AGATGAACGA GCCGCCGGGA AACCGTCTGC GCGTTGCGCT 600 

GACCGGCCTG ACCATGGCTG AGAAGTTCCG TGACGAAGGT CGTGACGTTC 650 

TGCTGTTCGT CGATAACATC TATCGTTACA CCCTGGCCGG TACTGAAGTA 700 

TCCGCACTGC TGGGTCGTAT GCCTTCAGCG GTAGGTTACC AGCCGACTCT 750 

GGCGGAAGAG ATGGGCGTTC TGCAGG 77 6 



2) INFORMATION FOR SEQ ID NO: 1656 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 572 bases 

(B) TYPE: Nucleic acid 

(C) STRANDEDNESS: Double 

(D) TOPOLOGY: Linear 

(ii) MOLECULE TYPE: Genomic DNA 

(vi) ORIGINAL SOURCE: 

(A) ORGANISM: Microsporum audouinii 

(B) STRAIN: ATCC 11347 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 1656 

GTGTTCATCC AGGAGTTGAT TGTAAGTGAT TATATTCCCC TAGAAAGAAA 50 
TGTTTTGAAC AAAAGTCTCG AATTAGAAAA TTCTTTTCAG ATACTAATTT 100 
ACTATAGAAC AACATTGCCA AGGCTCACGG TGGTTACTCC GTCTTCACCG 150 

855 
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GTGTCGGAGA GCGAACCCGT GAAGGAAACG 
GAAACTCGTG TCATCCAACT TGATGGCGAG 
CGGTCAGATG AACGAGCCCC CAGGTGCCCG 
GTTTGACCAT TGCTGAGTAC TTCCGTGATG 
TTCTTTAAAT TAGATATCTT CTGGAGAAAC 
AGTGCTTCTC TTCATCGACA ACATCTTCCG 
AAGTGTCTGC CCTGCTTGGT CGTATTCCAT 
ACTCTTGCCG TCGACATGGG TGGTATGCAG 
GAAGGGATCC ATTACCTCCG TC 



PCT/CAOO/01150 

ATCTGTACCA TGAAATGCAG 200 

TCCAAGGTCG CCCTGGTCTT 250 

TGCCCGTGTT GCTCTTACTG 300 

AGGAAGGTCA AGACGGTATG 350 

AGCGTCTAAC AAATTCTTCC 4 00 

TTTCACTCAG GCTGGTTCCG 450 

CTGCCGTCGG TTACCAACCC 500 

GAACGTATTA CCACCACCAA 550 

572 



2) INFORMATION FOR SEQ ID NO: 1657 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 790 bases 

(B) TYPE: Nucleic acid 

(C) STRANDEDNESS: Double 

(D) TOPOLOGY: Linear 

(ii) MOLECULE TYPE: Genomic DNA 

(vi) ORIGINAL SOURCE: 

(A) ORGANISM: Obesumbacterium proteus 

(B) STRAIN: ATCC 12841 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 1657 

GCCTAAAGTG TATAACGCAC TTGAGGTGAA AGGCGGTGCC ACTAAACTGG 50 

TACTGGAAGT TCAGCAGCAG CTAGGCGGCG GCGTTGTACG CTGTATCGCT 100 

ATGGGTACTT CTGACGGTCT GCGTCGCGGA CTGGACGTTG TTGACCTGGA 150 

GCACCCGATT GAAGTCCCAG TAGGTAAAGC GACCTTAGGC CGCATTATGA 200 

ACGTACTGGG TGAGCCAATT GATATGAAGG GTGATATCGG CGAAGAAGAT 250 

CGCTGGGCTA TTCACCGTGA AGCTCCAAGC TACGAAGAAC TGTCTAACTC 300 

GCT^GAACTG CTGGAAACCG GTATCAAGGT AATGGACTTG ATTTGTCCGT 350 

TCGCTAAGGG CGGTAAAGTC GGTCTGTTCG GTGGTGCGGG TGTTGGTAAA 400 

ACAGTAAACA TGATGGAGCT GATCCGTAAC ATCGCGATCG AGCACTCAGG 4 50 

TTACTCTGTA TTTGCCGGCG TGGGTGAACG TACTCGTGAG GGTAACGACT 500 

TCTACCACGA AATGACCGAC TCCAACGTAT TGGACAAAGT ATCACTGGTT 550 

TATGGCCAGA TGAACGAGCC ACCAGGAAAC CGTCTGCGCG TTGCGCTGAC 600 

CGGTCTGACT ATGGCTGAGA AGTTCCGTGA CGAAGGTCGT GACGTACTGC 650 

TGTTCATCGA TAACATCTAC CGTTATACCT TGGCCGGTAC CGAAGTATCT 700 

GCACTGCTGG GTCGTATGCC TTCTGCGGTA GGTTATCAGC CAACGCTGGC 750 

GGAAGAGATG GGTGTTCTGC AAGAACGTAT CACCTCTACC 7 90 



2) INFORMATION FOR SEQ ID NO: 1658 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 622 bases 

(B) TYPE: Nucleic acid 

(C) STRANDEDNESS: Double 

856 
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PCT/CAOO/01150 



(D) TOPOLOGY: Linear 
(ii) MOLECULE TYPE: Genomic DNA 

(vi) ORIGINAL SOURCE: 

(A) ORGANISM: Paracoccidioides brasiliensis 

(B) STRAIN: ATCC 200443 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 1658 



10 



TGTCTTCATT CAGGAGCTTA TCAACAACAT CGCCAAGGCC CACGGTGGTT 50 

ACTCCGTCTT CACTGGTGTG GGAGAGCGCA CTCGTGAGGG AAACGACTTG 100 

TATCACGAGA TGCAAGAGAC TTCCGTTATC CAGCTCGAAG GCGAATCCAA 150 

GGTTGCCCTC GTCTTCGGTC AAATGAACGA GCCTCCGGGT GCTCGTGCTC 200 

15 GTGTTGCTCT CACCGGTCGT AAGTGCTCCT TCCCAGATTT CTCTTCCCCA 250 

GTTTCTGGAC CCACTTTTTC CTTCCACCAC CATTCTACTG GGTAGGACCA 300 

AGATAGCACT GCCTATTCTG GTGCCTTCCT ACCGCCTACT CTACTGCCTA 350 

TTCCACCACC TTTTCTACCG CCTCTTCTAC TTGCTATTGT ATACTAACTT 4 00 

ACTCAAACAG TTACTATTGC TGAGTACTTC CGTGACGCTG AGGGCCAGGA 4 50 

20 TGTGCTTCTC TTCATCGACA ACATTTTCCG TTTCACCCAG GCCGGTTCCG 500 

AGGTGTCCGC TCTTCTCGGT CGTATCCCCT CCGCCGTCGG TTACCAGCCC 550 

ACCCTTGCCG TCGACATGGG TGGTATGCAG GAGCGTATCA CCACCACCAA 600 

GAAGGGATCC ATTACCTCCG TC 622 

25 

2) INFORMATION FOR SEQ ID NO: 1659 

(i) SEQUENCE CHARACTERISTICS: 
30 (A) LENGTH: 794 bases 

(B) TYPE: Nucleic acid 

(C) STRANDEDNESS: Double 

(D) TOPOLOGY: Linear 

35 (ii) MOLECULE TYPE: Genomic DNA 

(vi) ORIGINAL SOURCE: 

(A) ORGANISM: Plesiomonas shigelloides 

(B) STRAIN: ATCC 14029 



40 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 1659 



GACGCTGTAC CTCAGGTGTA CGATGCACTG ACAGTTGAGG GTGCTGAGCT 50 

GGTACTGGAA GTGCAGCAGC AGCTGGGTGG TGGTGTTGTT CGCTGTATCG 100 

45 CGATGGGTGC CTCTGATGGC CTCAAGCGCG GTCTGAAAGC GCACAATACT 150 

GGTGCTCCTA TCACTGTACC GGTGGGTGTG GAAACACTGG GCCGGATCAT 200 

GGATGTGTTG GGTAACCCGA TTGACCAGAA AGGTCCAATC GGTGAACAAG 250 

ATCGCTGGGT GATCCACCGT GAAGCACCAA GCTACGAAGA TCAGGCTAAC 300 

AGCACTGAAC TGCTGGAAAC CGGTATCAAG GTTATCGACC TGGTATGCCC 350 

50 GTTTGCGAAA GGCGGTAAAG TCGGTCTGTT CGGTGGTGCC GGTGTAGGTA 400 

AAACCGTAAA CATGATGGAG CTGATCCGTA ACATCGCGAT CGAGCACTCC 4 50 

GGTTATTCCG TGTTTGCGGG CGTGGGTGAG CGTACCCGTG AAGGTAACGA 500 

CTTCTACCAC GAAATGACAG ACTCCAACGT ACTGGACA/^ GTATCCCTGG 550 

857 

SUBSTITUTE SHEET (RULE 26) 
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TGTACGGTCA GATGAACGAG CCGCCAGGTA 

ACCGGCCTGA CCATTGCGGA GAAATTCCGT 

GCTGTTCATC GATAACATCT ACCGTTATAC 

CGGCACTGCT GGGCCGTATG CCTTCTGCGG 

5 GCGGAAGAGA TGGGTGTACT GCAAGAGCGT 



PCT/CAOO/01150 

ACCGTCTGCG CGTAGCACTG 600 

GATGAAGGTC GTGACGTACT 650 

CCTGGCGGGG ACCGAAGTAT 700 

TAGGTTATCA GCCAACGCTG 750 

ATTACCTCTA CCCG 794 



2) INFORMATION FOR SEQ ID NO: 1660 



10 



(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 7 99 bases 

(B) TYPE: Nucleic acid 

(C) STRANDEDNESS: Double 
15 (D) TOPOLOGY: Linear 

(ii) MOLECULE TYPE: Genomic DNA 

(vi) ORIGINAL SOURCE: 
20 (A) ORGANISM: Shewanella putrefaciens 

(B) STRAIN: ATCC 8071 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 1660 

25 AGGTATATGA CGCTSTGAAG ATCACAGGTG AAGGCGCCTG TAATGGTTTG 50 

GTGCTGGAAG TTCAGCAACA GCTAGGCGGT GGTGTAGTTC GTACTATCGC 100 

TATGGGTTCT TCTGATGGTC TGCGTCGTGG TCTTGAGGTT GTTAACTCAG 150 

GTTCACCTAT TTCTGTTCCT GTTGGTACCG CCACGCTTGG CCGTATCATG 200 

AACGTATTAG GTGAGCCTAT TGATGAAGCG GGTCCAATCG GTGAAGAAGA 250 

30 GCGTTATGTT ATTCACCGTG CAGCACCTTC ATATGAAGAT CAATCGAACA 300 

CTACTGT^CT GTTAGAGACA GGTATCAAGG TTATTGACCT TGTTTGTCCA 350 

TTCGCTAAGG GTGGTAAAGT AGGTCTGTTC GGTGGTGCGG GTGTTGGTAA 400 

AACAGTTAAC ATGATGGAAC TGATTAACAA CATCGCTAAA GCTCACTCGG 450 

GTCTTTCGGT GTTCGCCGGT GTGGGTGAAC GTACTCGTGA AGGTAACGAC 500 

35 TTCTACTACG AGATGAAAGA TTCTGGCGTT CTCGACAAAG TGGCCATGGT 550 

TTATGGTCAG ATGAACGAGC CACCAGGAAA CCGTTTACGC GTAGCACTGT 600 

CAGGTCTGAC AATGGCTGAG AAGTTCCGTG ACGAAGGTCG TGACGTATTG 650 

TTGTTCGTTG ACAACATCTA CCGTTATACC TTAGCCGGTA CTGAAGTATC 700 

TGCACTGTTA GGCCGTATGC CTTCTGCGGT AGGTTATCAA CCAACATTGG 750 

4 0 CTGi^AGAAAT GGGCGTTCTG CAAGAGCGTA TTACTTCAAC TAAGACGGG 799 



45 



2) INFORMATION FOR SEQ ID NO: 1661 



(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 18 bases 

(B) TYPE: Nucleic acid 

(C) STRANDEDNESS: Single 
50 (D) TOPOLOGY: Linear 

(ii) MOLECULE TYPE: DNA 



858 
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(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 1661 

TGGGAAGCGA AAATCCTG 18 



2) INFORMATION FOR SEQ ID NO: 1662 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 774 bases 

(B) TYPE: Nucleic acid 

(C) STRANDEDNESS: Double 

(D) TOPOLOGY: Linear 

(ii) MOLECULE TYPE: Genomic DNA 

(vi) ORIGINAL SOURCE: 

(A) ORGANISM: Ca/npyJoJbacter curvus 

(B) STRAIN: ATCC 35224 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 1662 

CTATGCCTCA AACTAGAGAG CATATCTTGC TATCTCGCCA AGTAGGCGTT 50 

CCATATATCG TTGTATTTAT GAACAAAGCC GATATGGTCG ATGACGCTGA 100 

GCTTCTTGAG CTAGTCGAGA TGGAAATTCG CGAGCTTCTT AACGAGTACA 150 

ACTTCCCTGG CGATGATACT CCTATCATAT CAGGTTCTGC TCTTAAAGCC 200 

CTCGAAGAGG CTA/W^GCAGG CGTTGATGGC GAGTGGTCAG CAAAAGTTCT 250 

TGAGCTTATG GATAAAGTCG ATGAGTATAT CCCAACTCCA GTTCGTGCTA 300 

CCGATAAAGA CTTCCTGATG CCTATCGAAG ACGTTTTCTC TATCTCAGGT 350 

CGTGGAACGG TCGTTACTGG TAGGATCGAA AAAGGTGTCG TAAAAGTTGG 4 00 

CGATACTATC GAGATCGTTG GTATCAAACC TACTCA/^CT ACGACAGTTA 4 50 

CTGGCGTTGA GATGTTTAGG AAAGAGATGG AACAAGGCGA GGCCGGTGAT 500 

AACGTAGGTG TTCTTTTAAG AGGTACTAAA AAAGAAGACG TCGAGCGCGG 550 

CATGGTTCTT TGTAAGCCAA AATCAATCAC TCCTCATACA AAATTTGAGG 600 

GTGAGGTTTA TATCCTAACA AAAGAGGAAG GCGGACGCCA CACTCCATTC 650 

TTTAACAACT ATAGACCACA ATTTTATGTA AGAACAACAG ACGTTACAGG 7 00 

TTCTATCACA CTTCCAGAAG GAACTGAGAT GGTTATGCCT GGAGATAATG 750 

TCAGAATTTC CGTTGAACTC ATCG 774 



2) INFORMATION FOR SEQ ID NO: 1663 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 7 91 bases 

(B) TYPE: Nucleic acid 

(C) STRANDEDNESS: Double 

(D) TOPOLOGY: Linear 

(ii) MOLECULE TYPE: Genomic DNA 

(vi) ORIGINAL SOURCE: 

(A) ORGANISM: Campylobacter rectus 

859 
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(B) STRAIN: ATCC 33238 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 1663 

5 TTCTGCGGCT GACGGCCCAA TGCCACAAAC TAGAGAGCAC ATCTTGCTAT 50 

CTCGCCAAGT AGGCGTTCCG TATATCGTTG TTTTTATGAA CAAAGCCGAT 100 

ATGGTCGATG ATGCCGAGCT TCTTGAGCTG GTTGAGATGG AGATTCGCGA 150 

GCTTCTAAAC GAGTATGATT TCCCTGGTGA CGATACTCCA ATCGTAGCAG 200 

GCTCTGCTCT TCAAGCTCTT AATGAAGCCA AAGCCGGAAC AGAAGGCGAG 250 

10 TGGTCTGCAA AAATTCTTGA GCTTATGGCT AAAGTTGACG AGTATATCCC 300 

GACTCCGGTT CGTGCAACGG ATAAAGACTT CTTGATGCCT ATTGAGGACG 350 

TTTTCTCTAT CTCCGGTCGC GGCACCGTCG TTACCGGCAG AATCGAAAAA 400 

GGTATCGTAA AAGTCGGTGA TACTATCGAG ATCGTAGGTA TCCGCGATAC 450 

TCAAACAACT ACCGTTACCG GCGTTGAGAT GTTCAGAAAA GAGATGGATC 500 

15 AAGGCGAAGC GGGCGATAAC GTAGGCGTTC TTCTAAGAGG CACTAAAAAA 550 

GAAGACGTTG AGCGCGGTAT GGTTCTTTGC AAACCTAAAT CAATCACTCC 600 

TCACACTAAA TTTGAGGGAG AGGTTTATAT CTTAACTAAA GAGGAAGGCG 650 

GACGCCATAC TCCATTCTTT AATAACTATA GACCGCAGTT TTATGTAAGA 700 

ACTACCGACG TTACCGGTTC TATCACTCTT CCGGAAGGAA CAGAGATGGT 750 

20 TATGCCTGGC GATAACTTAA AGATAAGCGT TGAGCTTATC G 791 



2) INFORMATION FOR SEQ ID NO: 1664 



25 



(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 810 bases 

(B) TYPE: Nucleic acid 

(C) STRANDEDNESS : Double 
30 (D) TOPOLOGY: Linear 

(ii) MOLECULE TYPE: Genomic DNA 

(vi) ORIGINAL SOURCE: 
35 (A) ORGANISM: Fonsecaea pedrosoi 

(B) STRAIN: ATCC 18831 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 1664 

40 CGACGGACAG ATGCCCCAGA CCAGGGAGCA CTTGCTCCTC GCCCGCCAGG 50 

TCGGTGTCAA GCGCATTGTC GTCTTCGTCA ACAAGGTCGA TGCCATTGAG 100 

GACAAGGAGA TGTTGGAGCT CGTCGAGATG GAGATGCGTG AGCTTCTCTC 150 

CAGCTACGGC TTCGAGGGTG ACGACACTCC CATCGTCATG GGTTCCGCCC 200 

TTTGCGCCAT TGAGGGCCGC GAGCCCGACA TTGGTGTCGA GAAGATTGAC 250 

45 GAGCTCCTCG AGCACGTCGA CACCTGGATC CCCACCCCCG AGCGTGACAT 300 

CGCCAAGCCT TTCCTCATGT CCGTTGAGGA CGTCTTCTCC ATTCCCGGCC 350 

GTGGTACCGT CGCTTCTGGC CGTGTCGAGC GTGGTGTCCT GAAGAAGGAT 400 

TCCGAAGTCG AGCTTGTCGG CAAGAACAAG AACCCCATCA AGACCAAGGT 4 50 

TACCGACATC GAGACCTTCA AGAAGTCTTG CGACGAGTCC CGCGCTGGTG 500 

50 ACAACTCCGG TCTCCTTCTC CGTGGTGTCA AGCGTGACGA TGTCCTCCGT 550 

GGCATGGTCG TTGTCCAGCC CGGCACCACC AAGGCCCACA AGAAGTTCCT 600 

TGCCTCCATG TACGTCCTCA CCAAGGAGGA GGGTGGCCGC CACACTGGTT 650 

TCGCCAACAA CTACAAGCCC CAGATGTTCA TCCGTACCGC CGATGAGGCC 700 

860 
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GCCACTCTTA CCTGGCCCGA GGGTACCGAG GAGGACAAGA TGGTCATGCC 750 
CGGTGACAAT GTCGAGATGA TCTGCGAGAT CCACAAGCCC ATTGCCGTCG 800 
AGCAAGGCCA 810 



2) INFORMATION FOR SEQ ID NO: 1665 

(i) SEQUENCE CHARACTERISTICS: 
10 (A) LENGTH: 19 bases 

(B) TYPE: Nucleic acid 

(C) STRANDEDNESS: Single 

(D) TOPOLOGY: Linear 

15 (ii) MOLECULE TYPE: DNA 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 1665 
CAGTACAGGT AGACTTCTG 19 

20 



2) INFORMATION FOR SEQ ID NO: 1666 

25 (i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 888 bases 

(B) TYPE: Nucleic acid 

(C) STRANDEDNESS: Double 

( D) TOPOLOGY : Linear 



30 



(ii) MOLECULE TYPE: Genomic DNA 



(vi) ORIGINAL SOURCE: 

(A) ORGANISM: Microsporum audouinii 
35 (B) STRAIN: ATCC 11347 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 1666 

ATGATTGCGA AACCTACTTG CTGTGGAAGA ATTTGGATAT TCTAACATTT 50 

40 CTCTAGGCCT CAAACCAGAG AGCATCTGCT CCTTGCCCGC CAGGTCGGTG 100 

TTCAGAAGCT CGTCGTTTTC GTTAACAAGG TCGACGCTGT TGAGGACCCA 150 

GAGATGTTGG AACTTGTCGA GCTAGAGATG CGTGAGCTGC TCAGCCACTA 200 

TGGTTTCGAG GGTGAGGAGA CCCCAATCAT TTTTGGCTCT GCTCTCTGCG 250 

CCCTTGAATC TCGACGACCA GAATTGGGTG TTGAGAAGAT CGATGAGCTA 300 

45 TTGAACGCTG TGGATACCTG GATTCCCACC CCAGAGCGTG CCACTGATAA 3 50 

GCCTTTCCTT ATGTCCATTG AGGAAGTTTT CTCCATCTCT GGTCGTGGTA 400 

CCGTCGTTTC CGGTCGTGTC GAGCGTGGTA TCCTCAAGAA GGACTCTGAT 4 50 

GTCGAAATTG TGGGTGGATC TGATACACCC ATCAAGACGA AGGTCACCGA 500 

CATTGAAACC TTCAAGAAGT CTTGTGACGA ATCCCGAGCT GGTGACAACT 550 

50 CCGGTCTACT TCTCCGAGGT GTCAAGCGTG AGGACTTGAG ACGTGGAATG 600 

GTTGTTGCTG CTCCCGGATC GACCAAGGCT CATACCGACT TCATGGTCTC 650 

CCTTTATGTT CTGACCGAGG CTGAGGGTGG CCGTTCCAAT GGATTCACCC 7 00 

ACAAGTACCG CCCACAGATG TTCATCCGTA CTGCCGGTAT GTATU^CCCTT 750 

861 



SUBSTITUTE SHEET (RULE 26) 



wo 01/23604 

TTTCTACCAT TCACTTTGTT TCACCACTGA 
CGAAGCCGCA TCTTTCAGCT GGCCTGGAGA 
TGCCTGGTGA CAATGTCGAG ATGATTTGCA 



PCT/CAOO/01150 

CTTGTATACT TTACCGCAGA 800 
GGATCAAGAC AAGAAGGCCA 850 
AGACCCTC 888 



2) INFORMATION FOR SEQ ID NO: 1667 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 793 bases 

(B) TYPE: Nucleic acid 

(C) STRANDEDNESS: Double 

(D) TOPOLOGY: Linear 

(ii) MOLECULE TYPE: Genomic DNA 

(vi) ORIGINAL SOURCE: 

(A) ORGANISM: Piedraia hortax 

(B) STRAIN: ATCC 24292 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 1667 

ATGCCGCAGA CCCGCGAGCA CTTGCTGCTC GCCCGTCAGG TCGGTGTCCA 50 

GAAGATCGTT GTTTTCGTCA ACAAGGTTGA TGCTATCGAC GACCCGGAGA 100 

TGCTGGAGCT TGTCGAGATG GAGATGCGTG AACTTCTCAG CACATACGGT 150 

TTCGAGGGTG ACGAGACCCC TGTTATTATG GGCTCCGCGC TCATGGCTCT 200 

CAACAACCAG CGCCCCGAGA TTGGTCAACA GAAGATTGAT GAACTCATGG 250 

CCGCTGTCGA CGAGTGGATC CCTACTCCCC AGCGTGACCT CGACAAGCCT 300 

TTCCTGATGT CTGTTGAGGA TGTCTTCTCC ATTGCTGGCC GTGGTACCGT 350 

TGTGTCCGGC CGTGTGGAGC GCGGTACCCT CAAGCGTGAT GAGGAAGTCG . 400 

AGCTTGTCGG CAAGGGTGTC GACCCCATCA AGACCAAGGT CACCGATATC 4 50 

GAGACTTTCA AGAAGTCCTG CGAGGAGGCT CAGGCTGGTG ACAACTCTGG 500 

TCTTCTGATC CGTGGTGTCC GCCGCGAGGA TGTTCGTCGC GGTATGGTTG 550 

TCTCCAAGCC CGGCACCGTC AAGTCTCACA CTCAGTTCCT GGCCTCGCTT 600 

TACGTTCTCA CCAAGGAGGA GGGTGGTCGC CACACTGGTT TCGGCGAGCA 650 

CTACCGTCCC CAGCTCTACC TCCGTACCTC AGACGAGTCT GTCGATCTGA 700 

CCTTCCCCGA GGGAACTGAG GATCACCACT CCAAGATCGT CATGCCTGGT 7 50 

GACAACATCG AGATGGTCGT CACGATGACT CACGCCAACG CTA 793 



2) INFORMATION FOR SEQ ID NO: 1668 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 891 bases 

(B) TYPE: Nucleic acid 

(C) STRANDEDNESS: Double 

(D) TOPOLOGY: Linear 

(ii) MOLECULE TYPE: Genomic DNA 

(vi) ORIGINAL SOURCE: 

(A) ORGANISM: Escherichia coli 

862 
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(B) STRAIN: K-12 KL1699 

(C) ACCESSION NUMBER: JOl 

(xi) SEQUENCE DESCRIPTION: SEQ 

AACATGATCA CCGGTGCTGC GCAGATGGAC 
TGCGACTGAC GGCCCGATGC CGCAGACTCG 
GTCAGGTAGG CGTTCCGTAC ATCATCGTGT 
GTTGATGACG AAGAGCTGCT GGAACTGGTT 
TCTGTCTCAG TACGACTTCC CGGGCGACGA 
CTGCTCTGAA AGCGCTGGAA GGCGACGCAG 
GAACTGGCTG GCTTCCTGGA TTCTTACATT 
TGACAAGCCG TTCCTGCTGC CGATCGAAGA 
GTGGTACCGT TGTTACCGGT CGTGTAGAAC 
GAAGAAGTTG AAATCGTTGG TATCAAAGAG 
TGGCGTTGAA ATGTTCCGCA AACTGCTGGA 
ACGTAGGTGT TCTGCTGCGT GGTATCAAAC 
CAGGTACTGG CTAAGCCGGG CACCATCAAG 
TGAAGTGTAC ATTCTGTCCA AAGATGAAGG 
TCAAAGGCTA CCGTCCGCAG TTCTACTTCC 
ACCATCGAAC TGCCGGAAGG CGTAGAGATG 
CAAAATGGTT GTTACCCTGA TCCACCCGAT 
GTTTCGCAAT CCGTGAAGGC GGCCGTACCG 



ID NO: 1668 



GGCGCGATCC 


TGGTAGTTGC 


50 


TGAGCACATC 


CTGCTGGGTC 


100 


TCCTGAACAA 


ATGCGACATG 


150 


GAAATGGAAG 


TTCGTGAACT 


200 


CACTCCGATC 


GTTCGTGGTT 


250 


AGTGGGAA.GC 


GAAAATCCTG 


300 


CCGGAACCAG 


AGCGTGCGAT 


350 


CGTATTCTCC 


ATCTCCGGTC 


400 


GCGGTATCAT 


CAAAGTTGGT 


450 


ACTCAGAAGT 


CTACCTGTAC 


500 


CGAAGGCCGT 


GCTGGTGAGA 


550 


GTGAAGAAAT 


CGAACGTGGT 


600 


CCGCACACCA 


AGTTCGAATC 


650 


CGGCCGTCAT 


ACTCCGTTCT 


700 


GTACTACTGA 


CGTGACTGGT 


750 


GTAATGCCGG 


GCGACAACAT 


800 


CGCGATGGAC 


GACGGTCTGC 


850 


TTGGCGCGGG 


C 


891 



2) INFORMATION FOR SEQ ID NO: 1669 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 805 bases 

(B) TYPE: Nucleic acid 

(C) STRANDEDNESS: Double 

(D) TOPOLOGY: Linear 

(ii) MOLECULE TYPE: Genomic DNA 

(vi) ORIGINAL SOURCE: 

(A) ORGANISM: Saksenaea vasiformis 

(B) STEIAIN: ATCC 60625 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 1669 

TCCTCGTGGT CGCCGCCACC GACGGCCCGA TGCCGCAGAC CAAGGAGCAC 50 

GTGCTCCTGG CCCGCCAGGT CGGCGTTCCG TACATCGTCG TCGCCCTCAA 100 

CAAGGCCGAC ATGGTGGACG ACGAGGAGAT CCTGGAGCTC GTCGAGCTCG 150 

AGGTCCGTGA GCTCCTCTCC GAGTACGAGT TCCCGGGCGA CGACGTTCCC 200 

GTCGTCAAGG TCTCCGCTCT GAAGGCCCTC GAGGGCGACA AGGAGTGGGG 250 

CAACTCGGTT CTCGAGCTCA TGAACGCCGT CGACACGGCG ATCCCCGAGC 300 

CCGAGCGTGA CGTCGACAAG CCGTTCCTCA TGCCGATCGA GGACGTCTTC 350 

ACGATCACCG GTCGCGGTAC GGTCGTCACC GGCCGTATCG AGCGTGGTGT 400 

CCTGAAGGTC AACGAGACCG TCGACATCAT CGGCATCAAG ACCGAGAAGA 450 

CCACCACCAC GGTCACCGGC ATCGAGATGT TCCGGAAGCT CCTCGACGAG 500 

GGCCAGGCCG GTGAGAACGT CGGTCTGCTC CTCCGTGGCA TCAAGCGCGA 550 

863 
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GGACGTCGAG CGCGGCCAGG 
ACACGGAGTT CGAGGCGCAG 
CGCCACACGC CGTTCTTCAA 
GACGGACGTG ACCGGCGTGG 
5 TGCCGGGTGA CAACACCGAG 
ATGGA 



TCATCATCAA GCCGGGCTCG 
GCCTACATCC TGTCCAAGGA 
CAACTACCGC CCGCAGTTCT 
TGACCCTCCC CGAGGGCACC 
ATGAAGGTGG AGCTCATCCA 



PCT/CAOO/01150 

GTCACGCCGC 600 
CGAGGGTGGC 650 
ACTTCCGTAC 700 
GAGATGGTCA 750 
GCCCGTCGCC 800 

805 



10 2) INFORMATION FOR SEQ ID NO: 1670 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 935 bases 

(B) TYPE: Nucleic acid 
15 (C) STRANDEDNESS : Double 

(D) TOPOLOGY: Linear 

(ii) MOLECULE TYPE: Genomic DNA 

20 (vi) ORIGINAL SOURCE: 

(A) ORGANISM: Trichophyton tonsurans 

(B) STRAIN: ATCC 56185 



25 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 1670 



GCTTCTGACG GTCAAATGTA ATTGAATGCC CGCCCAGACA GATGAAAGGA 50 

TTTGACGTTT CTAACATCAG TCTAGGCCTC AGACCAGAGA ACATTTGCTC 100 

CTTGCCCGCC AGGTCGGTGT CCAGAAGCTG GTCGTTTTCG TTAACAAGGT 150 

CGATGCCGTT GAGGACCCAG AGATGTTGGA GCTTGTCGAA CTTGAAATGC 200 

30 GTGAACTCCT CAGCCACTAC GGTTTCGAGG GTGAGGAGAC CCCCATCATT 250 

TTTGGCTCTG CTCTCTGTGC CCTCGAGTCC CGTCGACCTG AGCTTGGTGT 300 

CGAGAAGATT GACGAGCTAT TGAACGCCGT CGACACCTGG ATCCCCACCC 350 

CAGAGCGCGC CACTGATAAG CCTTTCCTCA TGTCCATTGA GGAAGTGTTC 400 

TCTATCTCTG GTCGTGGTAC CGTCGTCTCC GGTCGTGTTG AGCGTGGTAT 4 50 

35 CCTCAAGAAG GATTCGGACG TCGAAATTGT TGGTGGCTCT ACCACCCCTA 500 

TCAAGACCAA GGTCACCGAT ATCGAAACCT TCAAGAAGTC CTGCGATGAA 550 

TCTCGAGCTG GTGACAACTC TGGTCTCCTT CTCCGAGGTA TCAAGCGTGA 600 

GGACTTGAAG CGTGGAATGG TTGTTGCTGC CCCCGGATCC ACCAAGGCTC 650 

ACACCGACTT CATGGTCTCC CTCTACGTCC TGACTGAGGC TGAGGGTGGT 700 

40 CGTTCCAACG GCTTCACCCA CAAGTACCGC CCCCAAATGT TCATCCGTAC 750 

TGCTGGTATG TAACCCAAGT TTCCGCTATT TACTAAGTAG ATCATTGCTA 800 

ACTTGTATTT CCTTCCGTAG ACGAAGCCGC ATCTTTCAGC TGGCCTGGAG 850 

AAGACCAAGA CAAGAAGGCT ATGCCTGGTG ACAACGTCGA GATGATTTGC 900 

AAGACCCTCC ACCCCATTGC TGCCGAGGCT GGCCA 935 

45 



2) INFORMATION FOR SEQ ID NO: 1671 

50 (i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 772 bases 

(B) TYPE: Nucleic acid 

(C) STRANDEDNESS: Double 

864 



SUBSTITUTE SHEET (RULE 26) 



wo 01/23604 



PCT/CAOO/01150 



10 



(D) TOPOLOGY: Linear 
(ii) MOLECULE TYPE: Genomic DNA 

(vi) ORIGINAL SOURCE: 

(A) ORGANISM: Enterobacter aerogenes 

(B) STRAIN: ATCC 13048 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 1671 



ACGATGCCCT TGAGGTACAG AATGGTAATG AGAGCCTGGT GCTGGAAGTT 50 

CAGCAGCAGC TCGGCGGTGG CGTAGTCCGT GCTATCGCCA TGGGTTCTTC 100 

CGACGGTCTG CGTCGTGGTC TGGAAGTTAA AGACCTTGAG CACCCGATCG 150 

AAGTCCCGGT AGGTAAAGCG ACTCTGGGCC GTATCATGAA CGTCCTGGGT 200 

15 CAGCCGATCG ACATGAAAGG CGACATCGGC GAAGAAGAAC GTTGGGCTAT 250 

CCACCGCGCG GCGCCTTCCT ATGAAGAGCT GTCCAGCTCT CAGGAACTGC 300 

TGGAAACCGG CATCAAAGTT ATCGACTTGA TGTGTCCGTT CGCTAAGGGC 350 

GGTAAAGTTG GTCTGTTCGG TGGTGCGGGT GTAGGTAAAA CCGTAAACAT 400 

GATGGAGCTG ATCCGTAACA TCGCGATCGA GCACTCCGGT TACTCCGTGT 4 50 

2 0 TTGCGGGCGT TGGTGAGCGT ACTCGTGAGG GTAACGACTT CTATCACGAA 500 

ATGACCGACT CCAACGTTCT GGATAAAGTA TCCCTGGTTT ACGGCCAGAT 550 

GAACGAGCCG CCGGGAAACC GTCTGCGCGT TGCGCTGACC GGCCTGACCA 600 

TGGCTGAGAA ATTCCGTGAC GAAGGTCGTG ACGTTCTGCT GTTCGTCGAT 650 

AACATCTATC GTTACACCCT GGCCGGTACT GT^GTATCTG CACTGCTGGG 700 

25 CCGTATGCCT TCAGCGGTAG GTTATCAGCC GACTCTGGCG GAAGAGATGG 7 50 

GCGTTCTGCA GGAACGTATC AC 772 



30 2) INFORMATION FOR SEQ ID NO: 1672 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 1401 bases 

(B) TYPE: Nucleic acid 
35 (C) STRANDEDNESS: Double 

(D) TOPOLOGY: Linear 

(ii) MOLECULE TYPE: Genomic DNA 

40 (vi) ORIGINAL SOURCE: 

(A) ORGANISM: BordetBlla pertussis 

(B) STRAIN: Tohama 1 

(C) ACCESSION NUMBER: Genome project 

45' (xi) SEQUENCE DESCRIPTION: SEQ ID NO: 1672 

ATGAGCAACG GAACCATCGT TCAGTGCATC GGCGCCGTGG TGGATATTCA 50 

GTTCCCCCGC GATAACATGC CCAAGATCTA CGAAGCGCTC ACCCTGGTCG 100 

ACGAGGGTTC CTCGTTCGCC GAGAAGGGCT TGACGCTGGA AGTGCAACAA 150 

50 CAGCTGGGCG ACGGCGTGGT GCGTACCATC GCGCTGGGTT CCAGCGACGG 200 

GCTGCGCCGC GGCATGCAAG TGGCCGGCAC CGGCGCACCG ATCTCGGTGC 250 

CCGTGGGCCA CGGCACCCTG GGCCGCATCA TGGACGTGCT GGGCCGTCCC 3 00 

ATCGACGAAG CCGGTCCCAT CGCCTCCGAC GAGAAGCGCG CCATCCACCA 350 
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GCCCGCGCCC CGTTTCGACG AGCTGTCGCC GTCGGTCGAG CTGCTGGAAA 400 

CCGGCATCAA GGTTATCGAC CTGGTGTGCC CGTTCGCCAA GGGCGGCAAG 450 

GTCGGCCTGT TCGGCGGCGC CGGCGTGGGC AAGACCGTCA ACATGATGGA 500 

ACTGATCAAC AACATCGCCA AGCAGCACAG CGGCTTGTCG GTGTTCGCCG 550 

5 GCGTGGGCGA GCGTACCCGC GAAGGCAACG ACTTCTACCA CGA/ATGGAA 600 

GAGTCGAACG TTCTGGACAA GGTGGCCATG GTGTTCGGCC AGATGAACGA 650 

GCCCCCGGGC AACCGCCTGC . GCGTGGCGCT GACCGGCCTG ACCATGGCCG 70 0 

AGAAGTTCCG CGACGAAGGC CGTGACATCC TGTTCTTCGT CGACAACATC 750 

TACCGCTACA CCCTGGCCGG TACCGAAGTG TCGGCGCTGC TGGGCCGTAT 800 

10 GCCGTCGGCG GTGGGCTACC AGCCTACGCT GGCCGAGGAA ATGGGCGTGC 850 

TGCAAGAGCG CATCACCTCG ACCAAGACCG GTTCGATCAC CTCGATCCAG 900 

GCCGTGTACG TGCCTGCCGA CGACTTGACC GACCCGTCGC CCGCCACGAC 950 

CTTCCAGCAC TTGGACTCGA CCGTCGTGCT GTCGCGTGAC ATCGCTGCGC 1000 

TGGGCATCTA TCCCGCCGTG GACCCGCTGG ATTCCTCCAG CCGCCAGCTC 1050 

15 GACCCGCAAG TCGTGGGCGA AGAGCACTAC CAGGTGGCCC GTGGCGTGCA 1100 

GCAGACGCTG CAGCGCTACA AGGAACTGCG CGACATCATC GCGATTCTGG 1150 

GCATGGACGA ACTGTCGCCG GAAGACAAGC AGGCCGTGGC CCGCGCGCGC 12 00 

AAGATCCAGC GCTTCCTGTC GCAGCCCTTC TACGTGGCCG AAGTGTTCAC 1250 

CGGCTCGCCG GGCAAGTACG TGTCGCTGGC CGAAACGATC CGTGGCTTCA 13 00 

2 0 AGATGATCGT CGACGGCGAG TGCGACGCGC TGCCCGAGCA GGCGTTCTAC 13 50 

ATGGTCGGCA CGATCGACGA GGCCTTCGAG AAGGCCAAGA AACTCCAATA 1400 

A 1401 



25 



35 



2) INFORMATION FOR SEQ ID NO: 1673 



(i) SEQUENCE CHARACTERISTICS: 
(A) LENGTH: 7 97 bases 
30 (B) TYPE: Nucleic acid 

(C) STRANDEDNESS: Double 

(D) TOPOLOGY: Linear 



(ii) MOLECULE TYPE:* Genomic DNA 

(vi) ORIGINAL SOURCE: 

(A) ORGANISM: Arcanobacterium haemolyticum 

(B) STRAIN: ATCC 9345 

40 (xi) SEQUENCE DESCRIPTION: SEQ ID NO: 1673 

CAGCTACCGA CGGTCCAATG GCTCAGACCC GCGAGCACGT TCTTCTTGCT 50 

CGCCAGGTTG GCGTTCCACA GATCATCGTT GCTCTCAACA AGGCTGACAT 100 

GGTTGACGAC GAGGAAATCC TCGAACTCGT CGAAATGGAA GTTCGTGAGC 150 

45 TTCTCTCTTC CCAGGAGTAC CCAGGTGACG ACCTCCCAGT CGTCAAGATC 200 

TCGGCACTCA AGGCTCTCGA AGGCGATGCC GAATGGAGCA AGGCAATCGA 250 

AGATCTCATG GAAGCTGTCG ATACCTACTT CGACGATCCA GTGCGTGACC 300 

TCGATAAGCC ATTCCTCATG CCAATCGAAG ACGTCTTCAC CATCACCGGT 3 50 

CGTGGCACCG TTGTTACCGG CCGTGCAGAG CGCGGTATGC TCAACTTGAA 400 

50 CGAAGAAGTT GAAATCCTCG GTATCCGTGC ACCACAGAAG ACAACCGTTA 450 

CCGGTATCGA AATGTTCCAC AAGTCGATGG ATCACGCAGA TGCAGGCGAA 500 

AACTGTGGTC TTCTCCTCCG TGGCACCAAG CGCGAAGATG TTGAACGTGG 550 

TCAGGTTGTT GCCAAGCCAG GCACCATCAC CCCACACACC AACTTCGAAG 600 
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CTCAGGTCTA CGTGCTCGGT AAGGAAGAAG 
TTCTCCAACT ACCGTCCACA GTTCTACTTC 
CGTGATCACC CTTCCAGAGG GCACCGAAAT 
CCGACATGAC AGTTGAGCTC ATCCAGCCAA 



GTGGCCGTCA CAACCCATTC 650 

CGTACCACGG ATGTTACCGG 700 

GGTTATGCCA GGCGACAACA 750 

TCGCTATGGA AGAGGGC 797 



2) INFORMATION FOR SEQ ID NO: 1674 

10 (i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 785 bases 

(B) TYPE: Nucleic acid 

(C) STRANDEDNESS: Double 

(D) TOPOLOGY: Linear 



15 



(ii) MOLECULE TYPE: Genomic DNA 



(vi) ORIGINAL SOURCE: 

(A) ORGANISM: Butyrivibrio fibrisolvens 
20 (B) STRAIN: ATCC 19171 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 1674 

CTGATGGTCC TATGCCACAG ACCCGTGAGC ACATCCTATT AGCACGTCAG 50 

25 GTAGGCGTAC CATACATCAT CGTATTCCTA AACAAGTGCG ATATGGTTGA 100 

CGACGAGGAA TTATTAGAGT TAGTTGAGAT GGACGTACGT GATCTATTAA 150 

ATCAGTACCA GTTCCCAGGC GACGACACTC CAATCATCCG TGGTTCAGCA 200 

CTAGGTGCAT TAAACGGCGA AGAGAAGTGG AAAGAGGCAA TCTATCAGTT 250 

AGCAGACACT CTAGATTCAT ACATTCCAGA GCCAAAGCGT GATATCGATG 300 

30 ATCCATTCCT ATTACCAATC GAAGATATCT TCTCAATCTC AGGTCGTGGT 350 

ACTGTAGTAA CCGGCCGTGT AGAGCGTGGT ATTGTACACG TAGGTGACGA 400 

AGTTGAAATC GTTGGTATTC GTCCAACCAC CAAGACCACT GTAACTGGCG 450 

TTGAAATGTT CCGTAAGTTA CTAGACGAAG GTCGTGCAGG TGATAACGTT 500 

GGTGTTCTAC TACGTGGTAC CAAGCGTGAT GAGGTTGAGC GTGGTCAGGT 550 

35 TCTAGCTGCT CCAGGCACAA TCACTCCACA CACCAAGTTC ACTGGTCAGG 60 0 

TTTACGTACT AAGCAAGGAT GAAGGTGGTC GTCACACTCC ATTCTTCAAG 650 

GGCTACCGTC CACAGTTCTT CTTCCGTACA ACCGATATTA CCGGTTCTAT 700 

CGATCTGAAA GAGGGCGTAG AGATGGTAAT GCCAGGTGAT AACACCGACA 750 

TGACCGTAAC CCTAATCCAC CCAGTAGCTA TGGCT 785 

40 



2) INFORMATION FOR SEQ ID NO: 1675 

45 (i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 835 bases 

(B) TYPE: Nucleic acid 

(C) STRANDEDNESS: Double 

(D) TOPOLOGY: Linear 



50 



(ii) MOLECULE TYPE: Genomic DNA 
(vi) ORIGINAL SOURCE: 
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(A) ORGANISM: Campylobacter jejuni subsp. doylei 

(B) STRAIN: ATCC 4 934 9 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 1675 

5 

TGGGGCGATC TTGGTTGTTT CTGCTGCAGA TGGTCCTATG CCACAAACTA 50 

GAGAGCACAT TCTTCTTTCT CGTCAAGTAG GCGTTCCATA TATTGTTGTT 100 

TTTATGAATA AAGCAGATAT GGTTGATGAT GCTGAGCTTT TAGAGTTAGT 150 

TGAAATGGAA ATTAGAGAAT TATTAAGCTC TTATGATTTC CCAGGCGATG 200 

10 ATACACCTAT TATTTCTGGT TCTGCTTTAA AAGCTCTTGA AGAAGCTAAA 250 

GCTGGACAAG ATGGTGAATG GTCAGCAAAA ATTATGGATC TTATGGCTGC 300 

AGTTGATAGT TATATTCCAA CTCCAACTCG TGATACTGAA AAAGACTTCT 350 

TGATGCCAAT TGAAGACGTT TTCTCAATTT CAGGTCGTGG TACTGTTGTT 400 

ACAGGTAGAA TTGAAAAAGG TGTTGTAAAA GTAGGTGATA CTATCGAAAT 450 

15 CGTTGGTATT AAAGACACTC AAACAACAAC TGTAACAGGT GTTGAAATGT 500 

TCAGAAAAGA AATGGATCAA GGCGAGGCAG GAGATAACGT AGGTGTTCTT 550 

CTTCGTGGTA CTAAAAAAGA AGAAGTTATT CGCGGTATGG TTCTTGCTAA 600 

ACCAAi^ATCA ATTACTCCAC ACACTGACTT CGAAGCTGAA GTTTATATCT 650 

TAAATAAAGA TGAAGGTGGT AGACATACTC CATTCTTTAA CAACTATAGA 70 0 

20 CCACAGTTTT ATGTAAGAAC AACTGATGTT ACAGGTTCGA TTAAATTAGC 75 0 

TGATGGTGTT GAAATGGTTA TGCCAGGTGA AAATGTGAGA ATTACTGTAA 800 

GCTTGATCGC TCCAGTAGCA CTTGAAGAAG GAACT 835 



25 



35 



2) INFORMATION FOR SEQ ID NO: 1676 



(i) SEQUENCE CHARACTERISTICS: 
(A) LENGTH: 812 bases 
30 (B) TYPE: Nucleic acid 

(C) STRANDEDNESS: Double 

( D) TOPOLOGY : Linear 



(ii) MOLECULE TYPE: Genomic DNA 

(vi) ORIGINAL SOURCE: 

(A) ORGANISM: Campylobacter lari 

(B) STRAIN: ATCC 43675 

40 (xi) SEQUENCE DESCRIPTION: SEQ ID NO: 1676 

TTCTGCAGCA GACGGCCCTA TGCCACAAAC TAGAGAGCAT ATCTTACTTT 50 

CTCGTCAAGT AGGTGTACCA TATATTGTTG TTTTCATGAA CAAAGCTGAT 100 

ATGGTTGATG ATGCAGAATT ATTAGAATTA GTTGAAATGG AAATTAGAGA 150 

45 ATTACTAAGC TCTTATGATT TCCCAGGAGA TGATACTCCA ATTATTTCAG 2 00 

GTTCAGCATT ACAAGCTCTT GAAGAAGCAA AAGCTGGTCA AGATGGTGAA 250 

TGGTCTAAAA AAATCTTAGA TCTTATGGCT GCAGTTGATG ATTATATTCC 300 

AACTCCGGCT CGTGATACAG ATAAAGATTT CTTGATGCCA ATCGAAGATG 3 50 

TTTTCTCAAT CTCAGGTCGT GGAACTGTTG TTACCGGTAG AATTGAAAAA 4 00 

50 GGTGTTGTAA AAGTTGGTGA TACTATAGAA ATCGTTGGTA TTAGAGACAC 450 

TCAAACAACC ACAGTTACTG GTGTTGAAAT GTTTAGAAAA GAAATGGATC 500 

AAGGTGAAGC TGGTGATAAT GTTGGTGTAT TACTTCGTGG AACTAAAAAA 550 

GAAGATGTTG AACGTGGTAT GGTTCTTGCT AAACCAAAAT CAATCACTCC 600 
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ACATACTGAT TTTGAAGCAG AAGTTTATAT 
GTCGTCATAC TCCATTCTTT AATAATTATA 
ACAACTGATG TTACAGGTGC TATTAAACTT 
TATGCCAGGC GATAATGATA GAATTACTGT 
5 CACTTGAGGA AG 



PCT/CAOO/01150 

CTTAAATAAA GATGAAGGTG 650 
GACCGCAATT TTATGTAAGA 700 
GCAGAAGGCG TTGAGATGGT 750 
AAGTCTTATT GCTCCAGTTG 800 

812 



2) INFORMATION FOR SEQ ID NO: 1677 
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(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 828 bases 

(B) TYPE: Nucleic acid 

(C) STRANDEDNESS: Double 
15 (D) TOPOLOGY: Linear 

(ii) MOLECULE TYPE: Genomic DNA 

(vi) ORIGINAL SOURCE: 

2 0 (A) ORGANISM: Campylohacter sputorum subsp. sputorum 

(B) STRAIN: ATCC 35980 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 1677 

25 GCTATTCTTG TATGTTCAGC TGCAGATGGT CCAATGCCAC AGACTAGAGA 50 

GCATATTCTA CTATCAAGAC AAGTTGGTGT TCCATACATA GTTGTTTTCT 100 

TAAATAAAGA AGATATGGTT GATGATGCTG AGCTTATAGA GTTGGTTGAA 150 

GTTGAGGTTA GAGATTTATT AAATGAATAT GATTTCCCTG GAGATGATAC 2 00 

TCCAATCGTA ATAGGTTCTG CTCTTAAAGC TTTAGAAGAA GCAAAAGCTG 250 

3 0. GAACAGAGGG TGAATGGTCT GCTAAAATTA TGAAACTTAT GGATGCTGTT 300 

GATAGCTATA TCCCAACTCC AACAAGAGAT ACAGATAAAG ATTTCCTTAT 3 50 

GCCAATCGAA GATATCTTCT CAATTTCTGG TCGTGGTACA GTTGTAACAG 4 00 

GTAGAATTGA AAAAGGTGTA GTAAAAGTTG GCGAGACTAT TGAGATAGTT 4 50 

GGTATTAGAC CTACTCAAAC AACAACAGTT ACTGGTGTTG AAATGTTTAG 500 

35 AAAAGAGCTA GATGAAGGTG AAGCTGGAGA TAATGTTGGT ATCTTGTTAA 550 

GAGGTACAAA AT^AAGAAGAT GTTGAAAGAG GTATGGTTTT ATGTAAACCA 600 

AAATCAATCA CTCCTCACAA GAAATTTGAA GGCGAAGTTT ATATTCTTAC 650 

AAAAGATGAA GGTGGTAGAC ATACTCCTTT CTTTAGTAAC TATAGACCAC 700 

AATTTTATGT TAGAACAACA GATGTAACAG GTTCTATATC TCTTCCTGAG 750 

40 GGAACAGAGA TGGATATGCC TGGTGATAAT GTAAAACTTA CAGTTGAACT 800 

TATAAACCCA ATTGCTCTTG AGCAAGGA 828 



45 2) INFORMATION FOR SEQ ID NO: 1678 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 820 bases 

(B) TYPE: Nucleic acid 
50 (C) STRANDEDNESS: Double 

(D) TOPOLOGY: Linear 

(ii) MOLECULE TYPE: Genomic DNA 
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(vi) ORIGINAL SOURCE: 

(A) ORGANISM: Campylobacter upsaliensis 

(B) STRAIN: ATCC 49815 

5 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 1678 

CGATTTTGGT TGTTTCTGCT GCTGATGGTC CTATGCCACA AACTAGAGAG 50 

CACATTTTGC TTTCTCGTCA AGTGGGTGTT CCTTATATCG TAGTTTTTAT 100 

10 GAATAAGGCT GATATGGTTG ATGATGCAGA GCTTTTAGAA TTGGTTGAAA 150 

TGGAAATTAG AGAACTTTTA AGCTCTTATG ATTTCCCGGG CGATGACACT 200 

CCTATCATTT CAGGCTCTGC TCTTCAAGCC TTAGAAGAGG CTAAGGCGGG 250 

ACAAGATGGC GAGTGGTCAG CTAAGATTTT AGAGCTTATG AAGGCAGTTG 300 

ATGAGTATAT CCCAACTCCT GTTCGCGATA CTGAAAAAGA TTTCTTGATG 350 

15 CCTATTGAAG ATGTTTTTTC AATTTCTGGT CGTGGAACTG TTGTAACAGG 400 

TAGAATTGAA AAAGGTGTGG TTAAAGTCGG CGATACTATT GAGATAGTAG 450 

GTATCAAAGA TACTCAAACT ACAACAGTTA CAGGCGTTGA GATGTTTAGA 500 

AAAGAAATGG ATCAAGGTGA GGCTGGCGAT AATGTCGGTG TGCTTTTAAG 550 

AGGAACAAAA AAAGAAGATG TTCTTCGTGG TATGGTTCTT GCAAAGCCTA 600 

2 0 AATCTATCAC TCCTCATACT GATTTTGAAG CAGAAGTTTA TATTCTAAAT 650 

AAAGATGAGG GCGGTCGCCA TACTCCTTTC TTTAACAATT ATCGTCCGCA 700 

GTTTTATGTA AGAACGACTG ATGTAACTGG TTCTATTAAA TTAGCTGATG 750 

GTGTTGAGAT GGTTATGCCG GGTGAAAATG TAAGAATTAC AGTTAGCCTT 800 

ATCGCTCCAG TTGCACTTGA 820 

25 



2) INFORMATION FOR SEQ ID NO: 1679 

3 0 (i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 783 bases 

(B) TYPE: Nucleic acid 

(C) STRANDEDNESS: Double 

(D) TOPOLOGY: Linear 



35 



(ii) MOLECULE TYPE: Genomic DNA 



(vi) ORIGINAL SOURCE: 

(A) ORGANISM: Globicatella sanguis 
40 (B) STRAIN: ATCC 51173 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 1679 

CTGCAGCTGA TGGTCCAATG CCTCAAACAC GTGAACATAT CTTATTATCA 50 

45 CGTCAAGTAG GTGTTCCTTA CATGGTTGTC TTCTTAAACA AAGTTGACAT 10 0 

GGTTGACGAT GAAGAATTAT TAGAATTAGT TGAAATGGAA GTTCGTGATT 150 

TATTATCTGA ATACGAATTC CCTGGAGACG ACGTTCCAGT AATCGCTGGT 200 

TCAGCTTTAA AAGCTTTAGA AGGCGAAGAA CAATATGAAG CAAAAGTATT 250 

AGAATTAATG GAAGCTGTAG ATACATACAT TCCAGAACCA GTTCGTGATA 300 

50 CTGAAAAACC ATTCATGATG CCAGTTGAAG ATGTGTTCTC AATCACAGGT 350 

CGTGGTACAG TTGCTACTGG ACGTGTTGAA CGTGGACAAG TTAAAGTTGG 400 

TGACGAAGTT GAAATCGTTG GTATTGCTGA AGAAATTAGC AAAACAACTG 450 

TAACTGGTGT TGAAATGTTC CGTAAATTAT TAGATTACGC TGAAGCTGGA 500 
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GATAACATTG GTGCGTTATT ACGTGGTGTT 
TGGTCAAGTA TTAGCAAAAC CAGGAACAAT 
AGGCGGAAGT TTACGTATTA TCAAAAGAAG 
TTCTTCGCTA ACTACCGTCC TCAATTCTAC 
5 AGGTGTTGTA GAATTACCAG AAGGTACAGA 
ACGTATCAAT GACAGTTGAA TTAATTCACC 



ACACGTGAAC AAATCCAACG 550 

TACACCTCAT ACTAAATTCG 600 

AAGGTGGACG TCATACTCCA 650 

TTCCGTACAA CTGACGTTAC 700 

AATGGTAATG CCTGGAGATA 750 

CAA 783 



10 2) INFORMATION FOR SEQ ID NO: 1680 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 823 bases 

(B) TYPE: Nucleic acid 
15 (C) STRANDEDNESS: Double 

(D) TOPOLOGY: Linear 

(ii) MOLECULE TYPE: Genomic DNA 

20 (vi) ORIGINAL SOURCE: 

(A) ORGANISM: Lactobacillus acidophilus 

(B) STRAIN: ATCC 4356 



25 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 1680 



GCTATCTTAG TTGTTGCTGC AACTGATGGT CCTATGCCAC AAACTCGTGA 50 

ACACATTTTG CTTGCTCGTC AAGTTGGTGT TAACTACATC GTAGTATTCT 100 

TGAACAAGTG CGATTTAGTT GACGACCCAG AATTGATCGA CTTGGTTGAA 150 

ATGGAAGTTC GTGACTTGTT GACTGAATAC GATTACCCTG GTGATGATAT 200 

3 0 TCCAGTTGTT CGTGGTTCAG CATTAAAGGC TTTACAAGGT GACAAGGAAG 250 

CTCAAGACCA AATCATGAAG TTGATGGACA TTGTTGATGA ATACATCCCA 300 

ACTCCAGAAC GTCAAACTGA CAAGCCATTC TTGATGCCAG TTGAAGACGT 350 

ATTCACTATC ACTGGTCGTG GTACTGTTGC TTCAGGTCGT ATCGACCGTG 4 00 

GTACTGTTAA GGTCGGTGAC GAAGTTGAAA TCGTTGGTTT GGTAGATAAA 450 

35 GTTCTTAAGT CAGTTGTTAC TGGTTTGGAA ATGTTCCACA AGACTTTGGA 500 

CTTAGGTGAA GCCGGCGATA ACGTTGGTGT ATTGCTTCGT GGTGTTGACC 550 

GTGATCAAGT TGTTCGTGGT CAAGTATTGG CTGCACCCGG CTCAATCCAA 600 

ACTCATAAGA AGTTTAAGGC ACAAGTTTAT GTTTTGAAGA AGGACGAAGG 650 

TGGTCGTCAC ACTCCATTCT TCTCAGACTA CCGTCCACAA TTCTACTTCC 700 

40 ACACCACTGA TATTACTGGT GAAATTGAAT TGCCAGAAGG TACTGAAATG 750 

GTTATGCCTG GTGATAACAC TGAATTCACT GTTACTTTGA TCAAGCCAGC 800 

TGCCATCGAA AAGGGTACTA AGT 823 



45 

2) INFORMATION FOR SEQ ID NO: 1681 

(i) SEQUENCE CHARACTERISTICS: 
(A) LENGTH: 7 93 bases 
50 (B) TYPE: Nucleic acid 

(C) STRANDEDNESS: Double 

(D) TOPOLOGY: Linear 



871 
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(ii) MOLECULE TYPE: Genomic DNA 

(vi) ORIGINAL SOURCE: 

(A) ORGANISM: Leuconostoc mesenteroides subsp. 
5 dext rani cum 

(B) STRAIN: ATCC 19255 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 1681 

10 GCTGCAACTG ATGGTCCTAT GCCACAAACA CGTGAACACA TCTTGTTGGC 50 

ACGTCAAGTT GGTGTTGACT ACTTGGTTGT CTTCTTGAAC AAGACTGATT 100 

TGGTTGATGA TGAAGAATTA GTTGAATTGG TTGAAATGGA AGTTCGTGAA 150 

TTGTTGTCAG AATATGACTT CCCAGGTGAC GATATTCCTG TACTTAAGGG 200 

TTCAGCTTTG AAGGCTTTGG AAGGTGATCC TGAACAAGTT AAGGTTATCG 250 

15 AAGAATTGAT GGATACTGTT GATTCATACA TTCCAGAACC AGCACGTGAA 3 00 

ACAGACAAGC CATTCTTGAT GCCTGTCGAA GACGTCTTCA CAATCACTGG 3 50 

TCGTGGTACA GTTGCTTCTG GTCGTGTTGA CCGTGGTGTA TTGACTACAG 400 

GAACTGAAAT TGAAATCGTT GGTTTGAAGG ACGAAGTTCA AAAGACTACT 450 

GTTACAGGTA TCGAAATGTT CCGTAAGACT TTGGAAGAAG CTCAAGCGGG 500 

2 0 TGATAACATT GGTGCATTGT TGCGTGGTGT TGATCGTAGC AACATTGAAC 550 

GTGGTCAAGT TTTGGCAAAG CCAGGTTCAA TTAAGACACA CAAGAAGTTC 600 

AAGGCTGAAG TTTATGTCTT GACAAAGGAA GAAGGTGGTC GTCATACACC 650 

ATTCTTCACT AACTACCGTC CACAATTCTA CTTCCACACA ACTGATGTTA 700 

CAGGTGTTGT TGAATTGCCA GCCGGTGTTG AAATGGTTAT GCCTGGTGAC 750 

25 CAAGTGACAT TCGAAATCGA ATTGATCTCA CCAGTTGCCA TCG 7 93 



30 



2) INFORMATION FOR SEQ ID NO: 1682 



(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 796 bases 

(B) TYPE: Nucleic acid 

(C) STRANDEDNESS: Double 
35 (D) TOPOLOGY: Linear 

(ii) MOLECULE TYPE: Genomic DNA 

(vi) ORIGINAL SOURCE: 
40 (A) ORGANISM: Prevotella buccalis 

(B) STRAIN: ATCC 35310 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 1682 

45 TATCCTTGTA GTTGCTGCTA CTGATGGTCC TATGCCACAG ACACGTGAGC 50 

ACGTGCTTTT GGCTCGTCAG GTAAACGTTC CTCGTTTGGT TGTGTTCATG 100 

AACAAGTGTG ACTTGGTAGA AGACGAAGAG ATGCTTGAAC TCGTTGAAAT 150 

GGAGTTGCGC GAACTTCTTG AGCAATACGA ATTCGAAGAG GATACTCCAA 2 00 

TCGTTCGTGG TTCTGCACTG GGTGCATTGA ATGGTGTTGA CAAGTGGGTT 250 

50 GACAGCGTGA TGACGTTGAT GGACACTGTT GACGAGTGGA TTCAAGAGCC 3 00 

AGAGCGTGAC CTTGACAAAC CTTTCTTGAT GCCAGTAGAG GACGTGTTCT 3 50 

CTATCACAGG TCGTGGTACC GTTGTAACAG GACGTATTGA GACTGGTAAG 400 

GTAAAGGTTG GCGACGAGAT TCAGTTGCTC GGTCTTGGTG AGGACAAGAA 450 
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GTCTGTTGTA ACAGGCGTTG 
AAGCAGGTGA TAACGTAGGA 
GTAAAGCGTG GTATGGTTGT 
TCACTTCAAG GCTTCCATCT 
5 ATACTCCATT CGGAAACAAG 
GACTGTACAG GTGAGATCAC 
TGGTGACAAC GTAGAGATTG 



AAATGTTCCG TAAGATTCTT 
CTTTTGCTCC GCGGTATCGA 
TGTACACCCA GGTGCCATCA 
ATGTATTGAA GAAGGAAGAG 
TATCGTCCTC AGTTCTATCT 
TTTGCCAGAA GGCGTAGAGA 
AGGTTACCTT GATTTACAAG 



PCT/CAOO/01150 

TCTGAAGGTG 500 

TT^GGATGAA 550 

CTCCTCACGA 600 

GGTGGACGTC 650 

CCGTACAATG 700 

TGGTGATGCC 750 

GTTGCC 796 



10 



20 



2) INFORMATION FOR SEQ ID NO: 1683 



(i) SEQUENCE CHARACTERISTICS: 
(A) LENGTH: 800 bases 
15 (B) TYPE: Nucleic acid 

(C) STRANDEDNESS: Double 

(D) TOPOLOGY: Linear 



(ii) MOLECULE TYPE: Genomic DNA 

(vi) ORIGINAL SOURCE: 

(A) ORGANISM: Ruminococcus bromil 

(B) STRAIN: ATCC 27255 

25 (xi) SEQUENCE DESCRIPTION: SEQ ID NO: 1683 

GGTTGCTGCT ACTGACGGCC CGATGCCTCA GACTCGTGAG CACGTTCTGC 50 

TCGCTCGTCA GGTGGGTGTG CCCGCCATCG TCGTCGCCCT CAACAAGTGC 100 

GACATGGTTG ACGATGAGGA GCTCATTGAG CTTGTCGAGA TGGAGGTCCG 150 

30 CGAGCTGCTG ACCTCGCAGG AGTTCGACGG CGACAACTGC CCTGTCGTTC 2 00 

GCATCTCCGC CTTCCAGGCC CTCCAGGGTG ACGAGAAGTG GACCCAGTCG 250 

ATCCTCGACC TCATGGACGC CGTGGACGAG TACATCCCGC AGCCTGAGCG 3 00 

CGATCTCGAC AAGCCCTTCC TTATGCCGAT CGAGGACGTC TTCACCATCA 350 

CCGGCCGTGG CACCGTTGTC ACCGGTCGTG TCGAGCGTGG TGTCGTCAAG 400 

35 ACTGGCGAAG AGGTCGAGAT CGTCGGTATC CACGAGAAGA CCCAGAAGAC 450 

CACCGTTACC GGTGTCGAGA TGTTCCGTAA GATCCTCGAC GAGGGCCGCG 500 

CTGGTGAGAA CGTCGGCGTT CTGCTCCGTG GCACCAAGAA GGAGGACGTC 550 

GTTCGCGGCA TGGTCCTCTC CAAGCCTGGT TCCACCACCC CCCACACCGA 600 

CTTCGAGGGC CAGGTCTACG TCCTCAAGAA GGATGAGGGT GGCCGCCACA 650 

40 AGCCGTTCTT CTCCCATTAC AGCCCCCAGT TCTACTTCCG TACCACGGAC 700 

GTGACTGGCA CTGTTGAGCT CCCCGAGGGC ACCGAGATGG TCATGCCTGG 750 

CGACAACACC GACATGACTG TGCACCTGAT TCACCCGGTT GCCATGGAGG 800 



45 

2) INFORMATION FOR SEQ ID NO: 1684 

(i) SEQUENCE CHARACTERISTICS: 
(A) LENGTH: 545 bases 
50 (B) TYPE: Nucleic acid 

(C) STRANDEDNESS: Double 

(D) TOPOLOGY: Linear 
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(ii) MOLECULE TYPE: Genomic DNA 



PCT/CAOO/01150 



(vi) ORIGINAL SOURCE: 

(A) ORGANISM: Paracoccidioides hrasiliensis 
5 (B) STRAIN: ATCC 32075 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 1684 

TGTCTTTATT CAGGAACTGA TTGTATGTTT CTTCTCGTTT ATATATAACA 50 

10 TACCTTCTAT ATTTCATGTG TTTCTAACGA ACTCATAGAA CAACATTGCC 100 

AAGGCTCACG GTGGTTACTC CGTGTTCACC GGTGTCGGTG AGCGTACCCG 150 

TGAAGGAAAC GATCTGTACC ACGAAATGCA GGAAACCCGC GTCATCCAGC 200 

TGGACGGCGA GTCCAAAGTC GCCCTCGTCT TCGGCCAGAT GAACGAGCCC 250 

CCCGGAGCCC GTGCCCGTGT TGCCCTGACC GGTCTGACCA TCGCTGAATA 300 

15 CTTCCGTGAC GAAGAAGGCC AAGATGGTAC GTTCCCCCAT TCCATATATG 350 

TTTCTTGTGC GCTTTGCCAA CTAAACACCA CCTAGTGCTC CTCTTCATCG 4 00 

ACAATATCTT CCGCTTCACC CAAGCCGGTT CCGAAGTGTC CGCCCTGCTA 450 

GGCCGCATCC CCTCCGCCGT CGGCTATCAA CCCACCCTCG CCGTCGACAT 500 

GGGTGGTATG CAGGAGCGCA TCACAACTAC AACAAAAGGC TCCAT 545 

20 



2) INFORMATION FOR SEQ ID NO: 1685 

25 (i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 1020 bases 

(B) TYPE: Nucleic acid 

(C) STRANDEDNESS : Double 

(D) TOPOLOGY: Linear 



30 



(ii) MOLECULE TYPE: Genomic DNA 



(vi) ORIGINAL SOURCE: 

(A) ORGANISM: Candida norvegica 
35 (B) STRAIN: ATCC 36586 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 1685 

GATATCGCTT TATGGAAATT CGAAACTCCA AAATTCCACG TTACYGTTAT 50 

40 CGATGCTCCA GGTCACAGAG ATTTCATCAA GAATATGATT ACYGGTACCT 100 

CCCAAGCTGA TTGTGCTATT TTAATCATTG CTGGTGGTGT TGGTGAATTC 150 

GAAGCTGGTA TCTCCAAAGA TGGTCAAACC AGAGAACACG CTTTGTTAGC 200 

TTTCACCTTA GGTGTYAAAC AATTGATTGT TGCCGTTAAC AAAATGGACT 250 

CTGTCAAATG GGATCAATCC CGTTTCGAAG AAATCGTCAA GGAAGCTTCC 300 

45 GGTTTCATCA AGAAAGTTGG TTACAACCCA AAGACTGTTC CATTCGTTCC 350 

AATCTCTGGT TGGAATGGTG ACAACATGAT TGAAGTYTCW GCTAACGCYC 400 

CATGGTACAA AGGTTGGGAA AAGGAAACCA AAGCYGGTGT CGTTAAAGGT 450 

AAAACTTTAT TAGAAGCCAT TGATGCTATT GAACCACCTT CAAGACCAAC 500 

TGAAAAACCA TTGAGATTGC CATTGCAAGA TGTCTACAAG ATTGGTGGTA 550 

50 TCGGAACCGT ACCAGTCGGT ARAGTTGAAA CCGGTGTCAT TAAACCAGGT 600 

ATGATTGTTA CTTTCGCCCC AGCCGGTGTT ACTACTGAAG TCAAATCTGT 650 

TGAAATGCAT CACGAACAAT TAGAAGCTGG TTACCCAGGT GACAATGTTG 700 

GTTTCAACGT CAAGAATGTT TCAGTTAAAG T^AATCAGAAG AGGTAAHGTT 750 

874 



SUBSTITUTE SHEET (RULE 26) 



wo 01/23604 

GCTGGTGACT CCAAGAACGA 
TCAAGTTATT GTCTTGAACC 
CAGTTTTGGA TTGTCACACT 
TTGGAAAAGA TTGACAGAAG 
5 ATTCATCAAA TCTGGTGACG 
CATTRTGTGT TGAAGCTTTC 



TCCACCAAAA GGTGCTGAAT 
ATCCAGGTCA AATCTYTGCT 
GCCCACATTG CTTGTAAATT 
ATCCGGTAAG AAATTGGAAG 
CTGCTAWTGT CAAATTTGTT 



PCT/CAOO/01150 

CTTTCAACGC 800 
GGTTACTYTC 850 
CGATGAAATC 900 
AAAATCCAAA 950 
CCATWTAAAC 1000 
1020 



10 2) INFORMATION FOR SEQ ID NO: 1686 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 929 bases 

(B) TYPE: Nucleic acid 
15 (C) STRANDEDNESS: Double 

(D) TOPOLOGY: Linear 

(ii) MOLECULE TYPE: Genomic DNA 

20 (vi) ORIGINAL SOURCE: 

(A) ORGANISM: Aspergillus nidulans 

(B) STRAIN: WSA-176 



25 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 1686 



TGCTGCTTCC GATGGTCAAA TGTACGATTG ATATTCCTTC CAGCCAGTCA 50 

GGATAACAGC TGATACCAGT TGCAAATAGG CCCCAGACTC GTGAGCACTT 100 

GTTGCTTGCC CGTCAGGTTG GTGTCCAGAA GATCGTTGTC TTCGTCAACA 150 

• AGGTTGACGC TGTCGATGAC CCTGAGATGT TGGAGCTTGT TGAGCTCGAG 200 

30 ATGCGTGAGC TCCTCAACAC TTACGGTTTC GAGGGAGAGG AGACCCCTAT 250 

CATCTTCGGT TCCGCCCTGT GCGCTCTCGA AGGCCGCCGC GAGGACATTG 300 

GTACTCAGCG TATTGACTCC CTCCTCGAGG CCGTTGACAC TTGGATCCCT 350 

ACCCCCCAGC GTGACTTGGA CAAGCCCTTC CTGATGTCCA TTGAGGAAGT 4 00 

TTTCTCCATT GGTGGTCGTG GTACCGTCGC CTCTGGTCGT GTCGAGCGTG 450 

35 GTCTCCTCAA GAAGGATACC GAAGTTGAAA TTCACGGTGC TGATGGTATT 500 

CTGAAGACCA AGGTCACCGA CATTGAGACC TTCAAGAAGA GCTGCGATGA 550 

GTCTCGTGCT GGTGACAACT CCGGTCTTCT CCTCCGTGGT ATCCGTCGTG 600 

AGGATGTTCG TCGTGGTATG GTCATCGCTG CCCCTGGCTC CATCAAGGCC 650 

TCCAAGAAGT TCATGGTCTC CATGTACGTC TTGACTGAGG CTGAAGGTGG 700 

40 CCGCAAGAAC GGCTTCGGTG CCT^CTACCG CCCCCAGGCT TTCATCCGCA 750 

CTGCTGGTAA GTTTCGAACT ATTTGATTCA TTGATCACGT CCCTAACTGT 800 

TACTTTAGAC GAGGCTTGCG ACCTTCATTT CCCTGATGAG GCCGACAAGG 8 50 

ACCGCCACGT CATGCCCGGT GACAACGTCG AAATGGTCCT CAACCTCAAC 900 

AACCCCGTTG CTGCTGAGGC TGGACAGCG 92 9 

45 



2) INFORMATION FOR SEQ ID NO: 1687 

50 (i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 951 bases 

(B) TYPE: Nucleic acid 

(C) STRANDEDNESS: Double 
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PCT/CAOO/01150 



10 



(D) TOPOLOGY: Linear 
(ii) MOLECULE TYPE: Genomic DNA 

(vi) ORIGINAL SOURCE: 

(A) ORGANISM: Aspergillus terreus 

(B) STRAIN: WSA-174 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 1687 



TGCCGCTTCC GATGGTCAGA TGTACGCTCA AGCCCCAGTT TCCATATAAA 50 

CATAAACGAT CTATCATCAG CACAACGCTG ACTTCTTCGC TTCCAGGCCC 100 

CAGACCCGTG AGCACTTGCT GTTGGCCCGT CAGGTCGGTG TCCAGAAGAT 150 

CGTGGTCTTC GTCAACAAGG TCGATGCCGT TGATGACCCG GAGATGTTGG 200 

15 AGCTCGTTGA GCTGGAAATG CGCGAGCTCC TGACCAGCTA CGGATTCGAG 250 

GGTGAAGAGA CCCCCATCAT CTTCGGTTCT GCTCTCTGCG CTCTTGAGGG 300 

CCGCCGTCCT GAGATTGGTA CTGAGAAGAT TGACGAGCTG ATGCACGCCG 350 

TCGACACCTG GATCCCCACC CCCCAGCGTG ACCTCGACAA GCCCTTCCTG 400 

ATGTCCGTCG AGGAAGTCTT CTCCATTGCT GGTCGTGGTA CCGTCGCTTC 450 

2 0 CGGCCGTGTC GAGCGTGGTA TTCTGAAGAA GGATAGCGAA GTCGAGATCA 500 

TCGGTGGTGC TTTCGACGCC ACGAAGACCA AGGTCACTGA CATCGAGACC 550 

TTCAAGAAGT CTTGCGACGA GTCTCGCGCT GGTGACAACT CTGGTCTCCT 600 

CCTCCGTGGT ATCCGTCGTG AGGATGTTCG GCGTGGTATG GTCATTGCTG 650 

CTCCTGGCAG CACCAAGGCC CACGACAAGT TCCTTGTCTC TATGTACGTC 700 

25 CTCACTGAGG CTGAGGGTGG CCGTCGTACC GGATTCGGTA CCAACTACCG 7 50 

CCCCCAGGTC TTCATCCGTA CTGCCGGTAA GTGTTCCTGG AAGAGGCTTT 800 

GAGCCTATAT AGGATCTCGG ATAATTTACT AATCCACCAT ATAGATGAGG 850 

CCGCTGACCT CAGCTTCCCC GACAACGATG ACTCCCGCCG TGTCATGCCC 900 

GGTGACAACG TTGAGATGGT CCTGAAGACC CACCGCCCCG TGGCTGCTGA 950 

30 G 951 



35 



2) INFORMATION FOR SEQ ID NO: 1688 



(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 823 bases 

(B) TYPE: Nucleic acid 

(C) STRANDEDNESS: Double 
40 (D) TOPOLOGY: Linear 

(ii) MOLECULE TYPE: Genomic DNA 

(vi) ORIGINAL SOURCE: 
4 5 (A) ORGANISM: Candida norvegica 

(B) STRAIN: ATCC 36586 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 1688 

50 CGTTGCCGCT ACCGATGGTC AAATGCCTCA AACTAGAGAA CATTTGCTAT 50 

TGGCTAGACA GGTTGGTGTT CAACACATTG TCGTGTTTGT TAACAAGGTT 100 

GACACTATTG ATGATCCAGA AATGTTGGAA TTGGTTGAAA TGGAAATGAG 150 

AGAGTTGATT GCCACTTATG GTTTCGATGG TGATAACACC CCAGTTATCA 200 
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TGGGTTCTGC 
CAATCAATCG 
AACTAGAGAT 
CCATTTCTGG 
TTGAAGAAAG 
CAAAACCACC 
CTATGGCTGG 
GATATCAAGA 
CACCAAATTC 
GTCATTCAGG 
GGTGATGTCA 
GCAAGTCTTA 
CAACTGCTCT 



TCTATGTGCT 
ACAGATTGTT 
TTGGAAAAAC 
TAGAGGTACC 
GTGATGAAAT 
GTCACCGGTA 
TGATAACTGT 
GAGGTATGGT 
TTGGCCTCTT 
TTTTGCTGAA 
CTGTTACTTT 
CCAGGTGACA 
TGAAACCGGT 



TTGGAAGGTC GTGAACCTGA AATCGGTGCT 
GGAAGCCGTT GATGAATACA TTCCAACTCC 
CATTCTTGAT GGGTGTTGAA GATGTCTTCT 
GTCTGTACCG GTCGTGTTGA AAGAGGTAAC 
CGAAATTGTC GGCTACAACA AGACTCCAAT 
TTGAGATGTT CAAAAAGGAA TTAGACCAAG 
GGTATCTTAT TACGTGGTGT TAAGAGAGAT 
TATCTCTAAA GTCAACACCG TTTCCGCACA 
TATACGTCTT GACTAAAGAA GAAGGTGGTC 
AACTACAGAC CTCAATTGTT CATCAGAACC 
AACCTTCCCA GAAGATGCTG ATCACTCTCA 
ACGTTGAAAT GGAATGTACC TTGGTTCATC 
CAA 



250 
300 
350 
400 
450 
500 
550 
600 
650 
700 
750 
800 
823 



2) INFORMATION FOR SEQ ID NO: 1689 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 803 bases 

(B) TYPE: Nucleic acid 

(C) STRANDEDNESS: Double 

(D) TOPOLOGY: Linear 

(ii) MOLECULE TYPE: Genomic DNA 

(vi) ORIGINAL SOURCE: 

(A) ORGANISM: Candida parapsilosis 

(B) STRAIN: ATCC 20107 6 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 1689 

GCTGCTACTG ACGGTCAAAT GCCTCAAACT AGGGAACATA TGTTGTTGGC 50 

GAGACAAGTT GGTATCCAAA ACTTGGTTGT TTTTGTTAAC AAAGTTGATA 100 

CCATTGATGA CCCAGAAATG TTGGAATTGG TTGAAATGGA AATGAGGGAA 150 

TTATTGAGCT CTTATGGGTT TGATGGTGAA AACACTCCAG TTATCATGGG 200 

ATCAGCCTTG TGTGCTTTAG AAGGTAAACA ACCAGAAATC GGTGTTCAAG 250 

CCATTCAAAA ATTATTGGAT GCTGTTGATG AATATATTCC AACTCCAGAA 300 

AGAGATGCTG ACCAACCATT TTTGATGCCA GTGGAAGATG TGTTTTCTAT 350 

TTCAGGTAGA GGAACCGTTG TCACCGGAAG AGTTGAAAGA GGTATGTTGA 400 

AGAAAGGTGA AGAAGTAKAA GTCATTGGTG AAAACTCATT TAAGGCTACT 4 50 

TCCACGGGTA TTGAGATGTT CAAAAAGGAA TTGGATGCCG CTATGGCCGG 500 

TGACT^CTGT GGTATTTTGT TGAGAGGTGT CAAGAGAGAC GAAGTCAAGA 550 

GGGGTATGGT TTTGGCCAAA CCAGGTACCA CCACCCCACA CCAAAAGTTT 600 

TTGGCTTCCA TTTATATCTT GACTGCTGAA GAAGGTGGAC GTAGTACCCC 650 

TTTCAGTGAA GGATACAAAC CACAATGTTT CTTTAGAACT AGTGATGTTA 700 

CCACGACATT TACTTTCCCA GAAGGTGAAG GTGTTGACCA CTCACAAATG 750 

GTTATGCCAG GAGRCAATGT TGAAATGGTG GGAACTTTAA TCAAGAAAGC 800 

TCC 803 



2) INFORMATION FOR SEQ ID NO: 1690 
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(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 23 bases 

(B) TYPE: Nucleic acid 

(C) STRANDEDNESS: Single 

(D) TOPOLOGY: Linear 

(ii) MOLECULE TYPE: DNA 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 1690 
CAGGTCCTGT TGCGACTGAA GAA 



2) INFORMATION FOR SEQ ID NO: 1691 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 25 bases 

(B) TYPE: Nucleic acid 

(C) STRANDEDNESS: Single 

(D) TOPOLOGY: Linear 

(ii) MOLECULE TYPE: DNA 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 1691 
CACAGATAAA CCTGAGTGTG CTTTC 



2) INFORMATION FOR SEQ ID NO: 1692 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 24 bases 

(B) TYPE: Nucleic acid 

(C) STRANDEDNESS: Single 

(D) TOPOLOGY: Linear 

(ii) MOLECULE TYPE: DNA 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 1692 
GGTGAGAACT GTGGTATCTT ACTT 



2) INFORMATION FOR SEQ ID NO: 1693 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 24 bases 

(B) TYPE: Nucleic acid 

(C) STRANDEDNESS: Single 
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(D) TOPOLOGY: Linear 

(ii) MOLECULE TYPE: DNA 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 1693 
CATTTCAACG CCTTCTTTCA ACTG 



2) INFORMATION FOR SEQ ID NO: 1694 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 20 bases 

(B) TYPE: Nucleic acid 

(C) STRANDEDNESS: Single 

(D) TOPOLOGY: Linear 

(ii) MOLECULE TYPE: DNA 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 1694 
AAGGCAAGGA TGACAACGGC 



2) INFORMATION FOR SEQ ID NO: 1695 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 21 bases 

(B) TYPE: Nucleic acid 

(C) STRANDEDNESS: Single 

(D) TOPOLOGY: Linear 

(ii) MOLECULE TYPE: DNA 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 1695 
ACGATTTCCA CTTCTTCCTG G 



2) INFORMATION FOR SEQ ID NO: 1696 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 20 bases 

(B) TYPE: Nucleic acid 

(C) STRANDEDNESS: Single 

(D) TOPOLOGY: Linear 

(ii) MOLECULE TYPE: DNA 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 1696 

879 
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ATGTTCCTGT AGTTGCTGGA 



2) INFORMATION FOR SEQ ID NO: 1697 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 22 bases 

(B) TYPE: Nucleic acid 

(C) STRANDEDNESS: Single 

(D) TOPOLOGY: Linear 

(ii) MOLECULE TYPE: DNA 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 1697 
TTTCTTCAGC AATACCAACA AC 



2) INFORMATION FOR SEQ ID NO: 1698 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 23 bases 

(B) TYPE: Nucleic acid 

(C) STRANDEDNESS: Single 

(D) TOPOLOGY: Linear 

(ii) MOLECULE TYPE: DNA 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 1698 
GGAATCAACA GATGGTTTAC AAA 



2) INFORMATION FOR SEQ ID NO: 1699 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 20 bases 

(B) TYPE: Nucleic acid 

(C) STRANDEDNESS: Single 

(D) TOPOLOGY: Linear 

(ii) MOLECULE TYPE: DNA 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 1699 
GCATCTTCTG GGAAAGGTGT 



880 
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10 



15 



2) INFORMATION FOR SEQ ID NO: 1700 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 20 bases 

(B) TYPE: Nucleic acid 

(C) STRANDEDNESS: Single 

(D) TOPOLOGY: Linear 

(ii) MOLECULE TYPE: DNA 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 1700 
AAGATGCGGA AAGAAGCGAA 20 



2) INFORMATION FOR SEQ ID NO: 1701 

2 0 (i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 23 bases 

(B) TYPE: Nucleic acid 

(C) STRANDEDNESS: Single 

(D) TOPOLOGY: Linear 

25 

(ii) MOLECULE TYPE: DNA 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 1701 

3 0 ATTATGGATC AGTTCTTGGA TCA 23 



2) INFORMATION FOR SEQ ID NO: 1702 



35 



(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 213 bases 

(B) TYPE: Nucleic acid 

(C) STRANDEDNESS: Double 
40 (D) TOPOLOGY: Linear 

(ii) MOLECULE TYPE: Genomic DNA 

(vi) ORIGINAL SOURCE: 
45 (A) ORGANISM: Streptococcus gordonii 

(B) STRAIN: Challis V288 

(C) ACCESSION NUMBER: L20574 



50 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 1702 

TTCATAGACG CTGAGCACGC TTTGGATCCA TCTTACGCGG CTGCTCTAGG 50 
TGTAAATATT GATGAGCTGT TGCTATCTCA ACCAGATTCT GGTGAGCAAG 100 
GTTTAGAAAT TGCAGGAAAA TTGATTGACT CTGGGGCAGT TGATTTAGTT 150 
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GTCATCGACT CTGTTGCAGC TCTTGTACCA CGTGCGGAAA TCGATGGAGA 200 
•TATCGGTGAT AGC 213 



2) INFORMATION FOR SEQ ID NO: 1703 

(i) SEQUENCE CHARACTERISTICS: 
(A) LENGTH: 692 bases 
10 (B) TYPE: Nucleic acid 

(C) STRANDEDNESS: Double 

(D) TOPOLOGY: Linear 



15 



20 



(ii) MOLECULE TYPE: Genomic DNA 

(vi) ORIGINAL SOURCE: 

(A) ORGANISM: Streptococcus mutans 

(B) STRAIN: strain GS-5 

(C) ACCESSION NUMBER: M61897 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 1703 



GGGCCGGAAT 
GGCGCAAAAA 

25 TTGATCCAGC 
CTTTCACAAC 
GATTGATTCT 
TAGTACCACG 
TTACAAGCAC 

3 0 CAATAAAACA 
TTGGTATTAT 
AAGTTTTATT 
AGGAACCGGG 
AAGTTGTTAA 

35 ATTATATATG 
CAGTGATTTG 



CTTCTGGTAA 
GATGGCGGTA 
CTATGCTGCT 
CAGATTCAGG 
GGCGCTGTTG 
TGCGGAGATT 
GCATGATGAG 
AAAACCATTG 
GTTTGGTAAT 
CTTCTGTGCG 
GAACAAAAAG 
AAATAAAGTT 
GTGAAGGCAT 
GGAATTATCC 



GACAACTGTC 
TTGCCGCTTT 
GCTCTTGGCG 
AGAACAGGGT 
ATTTAGTTGT 
GACGGAGATA 
TCAAGCGATG 
CTATTTTTAT 
CCAGAAACAA 
TCTTGATGTC 
ACAGCAATAT 
GCTCCACCAT 
TTCTCGTACA 
AAAAAGCTGG 



GCTCTTCATG 
CATTGATGCA 
TTAATATTGA 
CTTGAAATTG 
TGTTGACTCA 
TTGGTAATAG 
CGTAAATTAT 
TAATCAATTG 
CCCCTGGCGG 
CGCGGCAATA 
TGGTAAAGAG 
TTAAGGAAGC 
GGTGAATTAG 
AGCTTGGTAC 



CTGCTGCTCA 50 

GAACATGCCC 100 

TGAGCTTTTG 150 

CAGGGAAATT 200 

GTGGCAGCTT 250 

TCATGTTGGC 300 

CAGCTTCAAT 350 

CGGGAAAAAG 4 00 

GCGTGCCTTG 450 

CTCAAATTAA 500 

ACCAAAATTA 550 

TTTTGTAGAA 600 

TTAAGATTGC 650 

TC 692 



40 2) INFORMATION FOR SEQ ID NO: 1704 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 1204 bases 

(B) TYPE: Nucleic acid 
45 (C) STRANDEDNESS: Double 

(D) TOPOLOGY: Linear 

(ii) MOLECULE TYPE: Genomic DNA 

50 (vi) ORIGINAL SOURCE: 

(A) ORGANISM: Streptococcus pneumoniaB 

(C) ACCESSION NUMBER: Z17307 
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ATGGCG7L?U^ 
AGAACGTGAA 
5 TTGGTAAAGG 
CAAGTGATGA 
TGGTTATCCT 
GTAAGACAAC 
GGGATTGCTG 

10 TGCGGCCCTT 
CAGGAGAGCA 
GTTGATCTTG 
AATTGATGGA 
TGAGCCAGGC 

15 ATTGCCATTT 
AAATCCAGAA 
TCCGCTTGGA 
AAAGAAACCA 
GGTAGCTCCA 

2 0 GAATTTCTAA 
ATCAAAAAAG 
AGGTTCTGAG 
ATGAAATTGA 
GT^GTTTCAG 

25 AGAAGCAGTG 
TCGAAATTGA 
TCGA 



AACCAAAAAA 
AAGGCCTTGA 
ATCAATCATG 
GCTCAGGTTC 
AAGGGACGTA 
GGTTGCCCTT 
CCTTTATCGA 
GGTGTCAATA 
AGGTCTTGAG 
TCGTAGTCGA 
GATATCGGAG 
CATGCGTAAA 
TTATCAACCA 
ACAACACCGG 
TGTTCGTGGT 
ATGTCGGTAA 
CCGTTTAAGG 
GACTGGTGAG 
CAGGGGCTTG 
AATGCTAAGA 
TAAGCAAGTC 
AACAAGATAC 
AATGAAGAAG 
AGAATAAGCT 



ATTAGAAGAA 
ATGACGCTCT 
CGTTTGGGTG 
TTTAGCTCTT 
TCATCGAAAT 
CATGCAGTTG 
TGCGGAACAT 
TTGACGAATT 
ATTGCGGGAA 
CTCAGTTGCT 
ATAGCCATGT 
CTTGGCGCCT 
ATTGCGTGAA 
GCGGACGTGC 
AATACACAAA 
AGAAACTAAG 
AAGCCGTAGT 
CTTTTGAAGA 
GTATTCTTAC 
AATACTTGGC 
CGTTCTAAAT 
TGAAAACAAA 
TTCCGCTTGA 
GTTAAAGCAG 



ATTTCAAAAA 
TAAATTGATT 
AACGTGCGGA 
GACATTGCCC 
CTATGGCCCA 
CACAAGCGCA 
GCCCTTGATC 
GCTCTTGTCT 
AATTGATTGA 
GCCCTTGTTC 
TGGTTTGCAG 
CTATCAATAA 
AAAGTTGGAG 
TTTGAAATTC 
TTAAGGGAAC 
ATTAAGGTTG 
TGAAATTATG 
TTGCAAGCGA 
AAAGATGAAA 
AGAGCACCCA 
TTGGCTTGAT 
ATU^GATGAGC 
CTTAGGCGAT 
TGGAGAAATC 



AATTTGGGGC 50 

GAGAAAGACT 100 

GCAA/y^GGTG 150 

TTGGCTCAGG 200 

GAGTCATCTG 250 

AAAAGAAGGT 300 

CAGCTTATGC 350 

CAACCAGACT 400 

CTCAGGTGCA 4 50 

CTCGTGCGGA 500 

GCTCGTATGA 550 

AACCAAAACA 600 

TGATGTTTGG 650 

TATGCTTCAG 700 

TGGTGATCAA 750 

TAAAAAATAA 800 

TACGGAGAAG 8 50 

TTTGGATATT 900 

AAATTGGGCA 950 

GAAATCTTTG 1000 

TGATGGAGAA 1050 

CAAAGAAAGA 1100 

GAACTTGAAA 1150 

CGCTACTTTT 1200 
1204 



30. 



2) INFORMATION FOR SEQ ID NO: 1705 



35 



(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 981 bases 

(B) TYPE: Nucleic acid 

(C) STRANDEDNESS: Double 

(D) TOPOLOGY: Linear 



40 



45 



(ii) MOLECULE TYPE: Genomic DNA 

(vi) ORIGINAL SOURCE: 

(A) ORGANISM: Streptococcus pyogenes 

(B) STRAIN: NZ131 

(C) ACCESSION NUMBER: U21934 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 1705 



ATGCGTTCAG GAAGTCTAGC 

TCCTAAAGGA CGTATCATCG 

50 CGACTGTGGC TTTACATGCT 

GCAGCCTTTA TCGATGCCGA 

GCTTGGGGTT AATATTGATG 

AACAAGGACT TGAAATTGCA 



TCTTGATATT 
AAATCTATGG 
GTAGCACAAG 
GCATGCGCTT 
AACTTCTCTT 
GGTAAATTGA 



GCTTGGATAG 
TCCAGAGTCT 
CTCAAAAAGA 
GATCCAGCTT 
GTCTCAACCA 
TTGATTCTGG 



CTGGTGGTTA 50 

TCCGGTAAAA 100 

AGGTGGAATC 150 

ATGCTGCTGC 200 

GATTCTGGAG 250 

TGCGGTTGAC 300 
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CTGGTTGTTG 
TGGTGATATT 
AGGCCATGCG 
ATCTTTATCA 
TGAAACAACA 
TGGATGTGCG 
GCCAGCATTG 
TCCGCCATTT 
CTCGTACAGG 
AAAGCAGGTG 
TGAAAATGCT 
TCGACCGTAA 
GAATCTGCTA 
TTTAGATAAT 



TCGATTCAGT 
GGCGATAGCC 
TAAATTATCA 
ACCAATTGCG 
CCAGGTGGTC 
TGGAAACAAC 
GTAAGGAGAC 
AAGGTAGCAG 
GGAGCTTGTG 
CTTGGTTCTC 
AAGCGTTATT 
AGTACGTGTT 
TGGCAGTAGC 
GGTATTGAAA 



AGCAGCTTTA 
ATGTCGGATT 
GCTTCTATTA 
TGAAAAAGTT 
GAGCTTTGAA 
CAAATTAAAG 
CAAAATCAAG 
AAGTTGAAAT 
AAAATTGCTT 
TTATAATGGT 
TGGCCGATCA 
AAATTTGGTT 
ATCAGAAGAA 
TTGAAGATTA 



GTGCCACGTG 
GCAAGCACGT 
ATAAAACAAA 
GGTGTGATGT 
ATTCTATGCT 
GAACTGGTGA 
GTTGTTAAAA 
CATGTATGGG 
CTGATTTGGA 
GAGAAGATTG 
TCCACAATTG 
TGCTTGAAGA 
ACCGATGATC 
A 



CTGAAATTGA 350 

ATGATGAGTC 4 00 

AACTATCGCA 450 

TTGGAAATCC 500 

TCTGTTCGGC 550 

CCAAAAGATA 600 

ACAAGGTCGC 650 

GAAGGTATTT 700 

CATTATCCAA 750 

GCCAAGGTTC 800 

TTTGATGAAA 850 

AAGCGAAGAA 900 

TTGCTTTAGA 950 

981 



15 



20 



25 



2) INFORMATION FOR SEQ ID NO: 1706 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 312 bases 

(B) TYPE: Nucleic acid 

(C) STRANDEDNESS: Double 

(D) TOPOLOGY: Linear 

(ii) MOLECULE TYPE: Genomic DNA 



30 



(vi) ORIGINAL SOURCE: 

(A) ORGANISM: Streptococcus salivarius subsp, 
thermophilus 

(C) ACCESSION NUMBER: M94 062 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 1706 



35 GCGTATGCAC GAGCTCTAGG 

GCCTGATTCT GGTGAGCAAG 

CTGGTGCAGT GGATTTAGTT 

CGTGCAGAAA TTGATGGAGA 

GCGTATGATG AGTCAAGCCA 

40 CAAAAACGAT TGCTATCTTT 

ATGTTTGGTA AC 



TGTTAATATC 
GTCTCGAAAT 
GTTGTTGACT 
TAGTGGTGAC 
TGCGTAAACT 
ATTAACCAGT 



GATGAGCTTC 
TGCAGGTAAG 
CAGTTGCGGC 
AGTCATGTAG 
TTCTGCATCT 
TGCGTGAA7U\ 



TTTTGTCGCA 
CTGATTGACT 
CTTCGTACCA 
GACTTCAAGC 
ATTAATAAAA 
AGTTGGTATC 



50 
100 
150 
200 
250 
300 
312 



45 2) INFORMATION FOR SEQ ID NO: 1707 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 831 bases 

(B) TYPE: Nucleic acid 
50 (C) STRANDEDNESS: Double 

(D) TOPOLOGY: Linear 

(ii) MOLECULE TYPE: Genomic DNA 

884 
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(vi) ORIGINAL SOURCE: 

(A) ORGANISM: Escherichia coli 
(C) ACCESSION NUMBER: J02967 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 1707 



10 



15 



20 



ATGAAAAACA 
TATTATCTAC 
CTCCATTATT 
ACAAACGCTG 
GGCACCAGAT 
CGGAAATAAT 
GGAATGGAGA 
TTCTGTTGTT 
AAATCAAGAA 
GGAGATAAAG 
CTTAAAAATT 
ATCACAATCT 
ATGTATCTAC 
TGCAGGATTC 
TTATTATAAG 
GGAAACTTGG 
GATCACCATT 



CAATACATAT 
AGCAGCGCCA 
TGAAGGAACT 
AAATTGCTCA 
TCAACTTTCA 
AGATCAGAAA 
TCTGGAACAG 
TGGGTTTCGC 
TTATCTCAAA 
AAAGAAACAA 
TCACCAGAAG 
CCCAGTTAAA 
AAGATCTGGA 
ACAGCAAATA 
CAAATCAGGA 
GGTCGAATTT 
CTAAACACAC 



CAACTTCGCT 
GTGCATCAAC 
GAAGGTTGTT 
ATTCAATAAA 
AGATCGCATT 
ACCATATTCA 
CAATCATACA 
AAGAAATAAC 
GATTTTGATT 
CGGATTAACA 
AACAAATTCA 
AACTCAGCCA 
TAATAGTACA 
GAACCTTACA 
CATAAATATG 
AACATCAAGC 
TAAATTTATA 



ATTTTTTTAA 
AGATATCTCT 
TTTTACTTTA 
GCAAAGTGTG 
ATCACTTATG 
AATGGGATAA 
CCAAAGACGT 
CCAAAAAATT 
ATGGAAATCA 
GAAGCATGGC 
ATTCCTGCGT 
TAGAAAACAC 
AAACTGTATG 
AAACGGATGG 
TTTTTGTGTC 
ATAAAAGCCA 
A 



TAATTGCAAA 50 

ACTGTTGCAT 100 

CGATGCATCC 150 

CAACGCAAAT 200 

GCATTTGATG 250 

AACCCCCAAA 300 

GGATGCAATT 350 

AGATTAAATA 400 

AGACTTCTCT 450 

TCGAAAGTAG 500 

AAAATTATTA 550 

CATAGAGAAC 600 

GGAAAACTGG 650 

TTTGAAGGGT 700 

CGCACTTACA 750 

AGAAAAATGC 800 

831 



25 



30 



35 



2) INFORMATION FOR SEQ ID NO: 1708 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 84 6 bases 

(B) TYPE: Nucleic acid 

(C) STRANDEDNESS: Double 

(D) TOPOLOGY: Linear 

(ii) MOLECULE TYPE: Genomic DNA 



40 



(vi) ORIGINAL SOURCE: 

(A) ORGANISM: Enterococcus faecalis 

(B) STRAIN: HH22 

(C) ACCESSION NUMBER: M60253 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 1708 



45 TTGAAAAAGT 
TAATTCAAAC 
ATAATGCTCA 
GTAAAATTTA 
GATAAATAGT 

50 AAAAAGTACA 
GAAAAATATG 
AATGACATAT 
GTGGAATCAA 



TAATATTTTT 
AGTTCACATG 
TATTGGTGTT 
ATTCAGATAA 
GCTATTTTGT 
TATTAACAAA 
TAGGAAAAGA 
AGTGATAATA 
AAAAGTTAAA 



AATTGTAATT 
CCAAAGAGTT 
TATGCTTTAG 
GAGATTTGCC 
TAGAACAAGT 
GATGATATAG 
TATCACTTTA 
CAGCAAACAA 
CAACGTCTAA 



GCTTTAGTTT 
AAATGATTTA 
ATACTAAAAG 
TATGCTTCAA 
ACCTTATAAT 
TTGCTTATTC 
AAAGCACTTA 
TAAAATTATA 
AAG2\ACTAGG 



TAAGTGCATG 50 

GAAAAAAAAT 100 

TGGT7\AGGAA 150 

CTTCAAAAGC 200 

AAGTTAAATA 250 

TCCTATTTTA 300 

TTGAGGCTTC 350 

AAAGAAATCG 400 

AGATAAAGTA 450 



885 



SUBSTITUTE SHEET (RULE 26) 



wo 01/23604 



PCT/CAOO/01150 



ACAAATCCAG 
CAAAAAAGAT 
TTATCGCAAA 
TTAATGTTAA 
AAAAGACTAT 
CTAGAAATGA 
GTTTTAGTCA 
TAAGTTGATA 



TTAGATATGA 
ACTTCAACAC 
TGGAAAATTA 
ATAATAAAAG 
AAGGTTGCTG 
TGTTGCTTTT 
TTTTTACGAA 
AGTGAAACCG 



GATAGAATTA 
CTGCTGCTTT 
AGCAAAGAAA 
CGGAGATACT 
ATAAAAGTGG 
GTTTATCCTA 
TAAAGACAAT 
CCAAGAGTGT 



AATTACTATT 
CGGTAAGACT 
ACAAAAAATT 
TTAATTAAAG 
TCAAGCAATA 
AGGGCCAATC 
AAAAGTGATA 
AATGAAGGAA 



CACCAAAGAG 500 

TTAAATAAAC 550 

CTTACTTGAT 600 

ACGGTGTTCC 650 

ACATATGCTT 700 

TGAACCTATT 750 

AGCCAAATGA 800 

TTTTAA 846 



10 



2) INFORMATION FOR SEQ ID NO: 1709 

(i) SEQUENCE CHARACTERISTICS: 
15 (A) LENGTH: 555 bases 

(B) TYPE: Nucleic acid 

(C) STEIANDEDNESS : Double 

(D) TOPOLOGY: Linear 



20 



(ii) MOLECULE TYPE: Genomic DNA 



25 



(vi) ORIGINAL SOURCE: 

(A) ORGANISM: Pseudomonas aeruginosa 
(C) ACCESSION NUMBER: M29695 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 1709 



30 



35 



ATGTCCGCGA 
CCTGCCGATG 
GGGGTGGCGA 
CTGCCCAGAG 
GGGCGAGGAA 
GTGATGGCTG 
CAGTCTCTGG 
TGTCCGCGCT 
TTCAGACCGA 
AAGGCAGGAT 
GGTTTACATG 
CCTAA 



GCACCCCCCC 
CTCCATGACT 
CGAAGAGCGA 
CGATGGCGGA 
CCGATCGGCT 
GTGGGAAGAT 
CTGACCCGAC 
CTCGTTGAAC 
CCCGACTCCG 
TCGTGCGGGA 
GTTCAAACAC 



CATAACTCTT 
GGCTCAACCG 
CCGACTCTTG 
AGAGTCCGTA 
ATGCTCAGTC 
GAAACTGATC 
ACAGTTGAAC 
TACTGTTCTC 
AACAACCATC 
GAAGATCATC 
GACAAGCCTT 



CGCCTCATGA 
GCCGCACATC 
ATGAAGTGCT 
ACACCGTACA 
GTACGTCGCG 
CAGGAGTGCG 
AAAGGCCTAG 
GGACCCCACC 
GAGCCATACG 
ACCACGCCTG 
CGAGAGAAAG 



CCGAGCGCGA 50 

GTTGAGTGGT 100 

GGAACACTAC 150 

TCGCAATGCT 200 

CTCGGAAGCG 250 

AGGAATAGAC 300 

GAACAAGGCT 350 

GTGACGAAGA 400 

CTGCTATGAG 450 

ACGGGCCGGC 500 

CGCGGTGTTG 550 

555 



40 



2) INFORMATION FOR SEQ ID NO: 1710 

45 (i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 732 bases 

(B) TYPE: Nucleic acid 

(C) STRANDEDNESS: Double 

(D) TOPOLOGY: Linear 



50 



(ii) MOLECULE TYPE: Genomic DNA 
(vi) ORIGINAL SOURCE: 
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(A) ORGANISM: Staphylococcus aureus 
(C) ACCESSION NUMBER: K02987 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 1710 



ATGAACCAGA 
GCATGTAAAA 
TAATAGAAAT 
ATGAGTCGAT 

10 GACTAAAGAA 
ATATTCTAAA 
AATATTCCTT 
AAGTCAGGCT 
GATTGCAAAA 

15 GATATAAAAA 
AAGTGTAGAC 
CAAAGAAGGA 
CGTGAATATC 
GCATGCAAAT 

20 CTATTTTCAA 



AAAACCCTAA 
GAAATATTGA 
CGGATCAGGA 
CAGTTACTGC 
GCGGTAAACC 
ATTTTCCTTC 
ATAACATCAG 
AAATATAGCT 
TCTGCAACGA 
TGCTCAAAAA 
TCTGTATTGA 
CTACAAAAAG 
GTGTTCTTTT 
GTCACTAATA 
TAGTTACAAA 



AGACACGCAA 
ATCACACGAA 
AAAGGACATT 
TATAGAAATT 
CCTCTGAGAA 
CCAAAACATA 
TACGGATATT 
ATCTTATCGT 
GCTTTGGGTT 
AGTACCACCA 
TTGTTCTTGA 
TATCGATCTT 
CACTAAAAAC 
TTAATAAACT 
TTGTTTCACT 



AATTTTATTA 
TATCAGTAAA 
TTACCAAAGA 
GATGGAGGCT 
TATAAAAGTG 
TAAACTATAA 
GTCAAAAGAA 
TGAGAAGGGA 
TACTATTAAT 
CTATATTTTC 
ACGACATCAA 
TTGTTTATAA 
CAATTCCGAC 
ATCGAAGGAA 
AA 



CTTCTAAAAA 50 

CT^GACAACG 100 

GCTAGTC/y^ 150 

TATGTCAAGT 200 

ATTCAAACGG 250 

GATATATGGT 300 

TTACCTTTGA 350 

TTTGCGAAAA 400 

GGTGGAGATG 450 

ATCCTAAGCC 500 

CCATTGATTT 550 

GTGGGTAAAC 600 

AGGCTTTGAA 650 

CAATTTCTTT 700 

732 



25 



30 



2) INFORMATION FOR SEQ ID NO: 1711 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 738 bases 

(B) TYPE: Nucleic acid 

(C) STRANDEDNESS: Double 

(D) TOPOLOGY: Linear 



(ii) MOLECULE TYPE: Genomic DNA 

(vi) ORIGINAL SOURCE: 
35 (A) ORGANISM: Escherichia coli 

(B) STRAIN: BM2570 

(C) ACCESSION NUMBER: M19270 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 1711 



40 



45 



50 



ATGAACAAAA 
ACTCAACCAA 
ACGAAATTGG 
AGTAAACAGG 
GTCAGAAAAA 
TTCTACAGTT 
ATTCCTTACC 
CCATGCGTCT 
CCTTGGATAT 
ATTCAGCAAT 
AGTAAACAGT 
ATAAATATTG 
GAATATCGTC 



ATAT7U\AATA 
ATAATAAAAC 
AACAGGTAAA 
TAACGTCTAT 
TTAAAATCGA 
TCAATTCCCT 
ATTTAAGCAC 
GACATCTATC 
TCACCGAACA 
TGCTTAAGCT 
GTCTTAATAA 
GAAGCTATAT 
AACTGTTTAC 



TTCTCAAAAC 
AATTGAATTT 
GGGCATTTAA 
TGAATTAGAC 
ATACTCGTGT 
AACAAACAGA 
ACAAATTATT 
TGATTGTTGA 
CTAGGGTTGC 
GCCAGCGGAA 
AACTTACCCG 
ACGTACTTTG 
TAAAAATCAG 



TTTTTAACGA 
AAAAGAAACC 
CGACGAAACT 
AGTCATCTAT 
CACTTTAATT 
GGTATAAAAT 
AAAAAAGTGG 
AGT^^GGATTC 
TCTTGCACAC 
TGCTTTCATC 
CCATACCACA 
TTTCAAAATG 
TTTCATCAAG 



GTGAAAAAGT 50 

GATACCGTTT 100 

GGCTAAAATA 150 

TCAACTTATC 200 

CACCAAGATA 250 

TGTTGGGAAT 300 

TTTTTGAAAG 350 

TACAAGCGTA 400 

TCAAGTCTCG 450 

CTAAACCAAG 500 

GATGTTCCAG 550 

GGTCAATCGA 600 

CAATGAAACA 650 
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CGCCAAAGTA AACAATTTAA GTACCGTTAC TTATGAGCAA 
TTTTTAATAG TTATCTATTA TTTAACGGGA GGAAATAA 



PCT/CAOO/01150 
GTATTGTCTA 



700 
738 



2 ) INFORMATION FOR SEQ ID NO: 1712 



10 



(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 735 bases 

(B) TYPE: Nucleic acid 

(C) STRANDEDNESS : Double 

(D) TOPOLOGY: Linear 



15 



20 



(ii) MOLECULE TYPE: Genomic DNA 

(vi) ORIGINAL SOURCE: 

(A) ORGANISM: Staphylococcus aureus 

(B) STRAIN: RN451 

(C) ACCESSION NUMBER: M17990 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 1712 



ATGAACGAGA 
TAATATAGAT 

25 TCTTTGAAAT 
AGGTGTAATT 
TACAGAAAAT 
ATATATTGCA 
AATATACCTT 

30 TAGTATAGCT 
GATTATTAAA 
GATATTTCTA 
TAGAGTGAAT 
CACACAAAGA 

35 AAAGAATACA 
ACATGCAGGA 
CTCTTTTCAA 



AAAATATAAA 
AAAATAATGA 
CGGCTCAGGA 
TCGTAACTGC 
AAACTTGTTG 
GTTTAAATTT 
ATAACATAAG 
GATGAGATTT 
TACAAAACGC 
TATTAAGTAT 
AGCTCACTTA 
TAAACAGAAG 
AGAAAATATT 
ATTGACGATT 
TAGCTATAAA 



ACACAGTCAA 
CAAATATAAG 
AAAGGGCATT 
CATTGAAATA 
ATCACGATAA 
CCTAAAAACC 
TACGGATATA 
ATTTAATCGT 
TCATTGGCAT 
GGTTCCAAGA 
TCAGATTAAA 
TATAATTATT 
TACAAAAAAT 
TAAACAATAT 
TTATTTAATA 



AACTTTATTA 
ATTAAATGAA 
TTACCCTTGA 
GACCATAAAT 
TTTCCAAGTT 
AATCCTATAA 
ATACGCAAAA 
GGAATACGGG 
TATTTTTAAT 
GAATATTTTC 
TAGAAAAAAA 
TCGTTATGAA 
CAATTTAACA 
TAGCTTTGAA 
AGTAA 



CTTCAAAACA 50 

CATGATAATA 100 

ATTAGTACAG 150 

TATGCAAAAC 200 

TTAAACAAGG 250 

AATATTTGGT 300 

TTGTTTTTGA 350 

TTTGCTAAAA 4 00 

GGCAGAAGTT 450 

ATCCTAAACC 500 

TCAAGAATAT 550 

ATGGGTTAAC 600 

ATTCCTTAAA 650 

CAATTCTTAT 7 00 

735 



40 

2) INFORMATION FOR SEQ ID NO: 1713 

(i) SEQUENCE CHARACTERISTICS: 
<A) LENGTH: 1029 bases 
45 (B) TYPE: Nucleic acid 

(C) STRANDEDNESS: Double 

(D) TOPOLOGY: Linear 

(ii) MOLECULE TYPE: Genomic DNA 

50 

(vi) ORIGINAL SOURCE: 

(A) ORGANISM: Enterococcus faecalls 

(B) STRAIN: V583 
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(C) ACCESSION NUMBER: U00456 . 
(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 1713 



5 ATGAATAAAA 
TGATGTGTCG 
AAAAATTCGA 
CTATGCAAGA 
ATTCTCCCCG 

10 AATACGAAAC 
TGCGGGGAGG 
CTATGTAGGC 
TGGCCTACAT 
ATGATTGAAA 

15 CTTTGTGAAG 
ACAGTACGGA 
GGAAAAATCT 
GGTCATGGGA 
GGTTGAGCCA 

20 GGCTCAGAGA 
ACGAAATCGG 
GCAGAGGGCT 
GTTCTAAACG 
TCCACGCATG 

25 GCCTGATTAC 



TAAAAGTCGC 
GTAAAATCCG 
TCCGCACTAC 
AGCCATGTAC 
GATAGGAAAA 
TCGGCGTATT 
ATGGTGCGAT 
TGCGATATTC 
TCTTACAAAA 
AAGGTGACAA 
CCGGCACGGT 
AGAACTAAAC 
TAATTGAGCA 
AACGAGGATG 
CGGTATCTTC 
ATGCGATGAT 
GTGCAAGAAA 
TGCTCGTGTT 
AGGTCAATAC 
GCGGCTGCCG 
ATTGGCGATA 



AATTATCTTC 
CAATAGAAAT 
ATCGGAATTA 
GGAATGGGAA 
CGCATGGTCT 
GACGTGGCTT 
ACAGGGTCTG 
AAAGCTCCGC 
AATGCGGGCA 
ACCGGAGGCG 
CAGGTTCGTC 
GCTGCGATAG 
AGCGATTTCG 
ATTTGATTGT 
CGCATCCATC 
TATCGTTCCA 
CGGCAAAGAA 
GATCTTTTTT 
CCTGCCCGGT 
CAGGAATCAC 
GAGAGGTGA 



GGCGGTTGCT 
TGCTGCGAAC 
CAAAAAACGG 
GCCGATAGTC 
GCTTGTCATG 
TCCCGGTTTT 
TTTGAATTGT 
AGCTTGCATG 
TCGCCGTCCC 
AGGACGCTTA 
CTTTGGCGTA 
AAGCAGCAGG 
GGCTGTGAGG 
CGGCGAAGTG 
AGGAAAACGA 
GCAGACATTC 
AGTATATCGG 
TGCAGGAGGA 
TTTACATCGT 
GCTTCCCGCA 



CGGAGGAACA 50 

ATTAATACTG 100 

CGTATGGAAG 150 

TCCCCGCCAT 200 

AAAGAAAGAG 250 

GCATGGCAAA 300 

CTGGTATCCC 350 

GACAAATCAC 4 00 

CGAATTTCAA 4 50 

CCTACCCTGT 500 

ACCAAAGTAA 550 

ACAATATGAT 600 

TCGGCTGCGC 650 

GATCAAATCC 700 

GCCGGAAAAA 750 

CGGTCGAGGA 800 

GTGCTTGGAT 850 

TGGCGGCATC 900 

ACAGCCGCTA 950 

CTAATTGACA 1000 
1029 



2) INFORMATION FOR SEQ ID NO: 1714 
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(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 818 bases 

(B) TYPE: Nucleic acid 

(C) STRANDEDNESS: Double 
35 (D) TOPOLOGY: Linear 

(ii) MOLECULE TYPE: Genomic DNA 

(vi) ORIGINAL SOURCE: 
40 (A) ORGANISM: Campylobacter jejuni subsp. jejuni 

(B) STRAIN: ATCC 33292 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 1714 

45 GCACAGGTTC AGTTGGGCTT GATCTTGCTT TAGGTATAGG CGGTGTTCCA 50 

AAAGGAAGAA TTATAGAAAT TTATGGRCCT GAAAGTTCAG GTAAAACCAC 100 

TCTAACTCTA CACATTATCG CAGAATGCCA AAAAGCAGGT GGRGTTTGTG 150 

CTTTTATCGA TGCAGAACAT GCRCTTGATG TRAAATATGC TAAAAATTTG 200 

GGTGTAAATA CAGATGATTT GTATGTTTCT CAGCCTGATT TTGGAGARCA 250 

50 AGCCTTAGAA ATTGTAGAAA CTATAGCAAG AAGTGGCGCA GTAGATCTTA 300 

TCGTAGTAGA TAGCGTTGCA GCGCTTACCC CAAAAGCAGA AATTGAAGGC 350 

GATATGGGTG ATCAACATGT AGGACTTCAA GCAAGACTTA TGTCTCAAGC 4 00 

TCTAAGAAAA CTTACAGGTA TAGTTCATAA AATGAATACC ACAGTAATTT 450 
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TYATCAACCA 
GAAACCACAA 
AGATGTTAGA 
ACCGCGTTAA 
CAAGCTGAAT 
ATTGATCGAT 
GGTTTTCTTA 
GCTTTCTTAA 



AATTCGTATG 
CAGGTGGAAA 
AAAGTAGCAA 
AGTAAAAGTA 
TTGATGTGAT 
TATGGTGTAA 
TAAAGATAAA 
AAGAAAAC 



AAAATCGGTG 
TGCATTAAAA 
CCTTAAAACA 
GTTAAAAATA 
GTTTGGAGAG 
AACTTGATAT 
AAACTTGGAC 



CTATGGGTTA 
TTTTATGCTT 
AAACGAAGAA 
AAGTTGCTCC 
GGTTTAAGCC 
CGTAGATAAA 
AAGGTAGAGA 



TGGTACTCCT 500 

CTGTGCGTTT 550 

CCTATAGGAA 600 

TCCATTCAGA 650 

GTGAAGGTGA 700 

AGTGGTGCGT 750 

AAATTCAAAA 800 

818 



2) INFOEIMATION FOR SEQ ID NO: 1715 



15 



(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 809 bases 

(B) TYPE: Nucleic acid 

(C) STRANDEDNESS: Double 

(D) TOPOLOGY: Linear 



20 



(ii) MOLECULE TYPE: Genomic DNA 



25 



30 



35 



40 



(vi) ORIGINAL SOURCE: 

(A) ORGANISM: Ablotrophia adiacens 

(B) STRAIN: ATCC 49175 

(C) ACCESSION NUMBER: AF124224 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 1715 



45 



TGGTGCTATC 
GTGAACACAT 
TTCTTAAACA 
AGAAATGGAA 
ACACTCCAGT 
TCATACRAAG 
TCCAACTCCA 
ACGTGTTCTC 
CGTGGACAAG 
AGAAACTTCA 
TAGACTACGC 
ACACGTGACA 
CACTCCACAT 
AAGGTGGACG 
TTCCGTACAA 
AATGGTAATG 
CAGTAGCGA 



TTAGTAGTAT 
CTTATTATCA 
AAGTTGACAT 
GTTCGTGACT 
TGTTGCAGGT 
AAAAAATCTT 
GAACGYGACG 
AATCACAGGT 
TTCGTGTTGG 
AAAAC7\ACTG 
TGAAGCAGGG 
ACATCGAACG 
ACTAAATTCA 
TCATACTCCA 
CAGACATCAC 
CCTGGTGATA 



CTGCAGCTGA 
CGTCAAGTAG 
GGTTGACGAT 
TATTATCAGA 
TCTGCTTTAC 
AGAATTAATG 
TTGACATU^CC 
CGTGGTACTG 
TGACGAAGTT 
TAACTGGTGT 
GATAACATTG 
TGGACAAGTT 
AAGCTGAAGT 
TTCTTCTCTA 
TGGTGTTTGT 
ACGTAACTAT 



TGGTCCAATG 
GTGTTCCTTA 
GAAGAATTAT 
ATACGATTTC 
GCGCTTTAGA 
GCTGCTGTTG 
ATTCATGATG 
TTGCTACAGG 
GAAATCGTTG 
TGAAATGTTC 
GTACATTATT 
CTTGCTAAAC 
TTACGTATTA 
ACTACCGTCC 
GTGTTACCAG 
GGAAGTTGAA 



CCTCAAACAC 


50 


CATCGTTGTA 


100 


TAGAATTAGT 


150 


CCAGGCGATG 


200 


AGGCGACGCT 


250 


ACGAATACAT 


300 


CCAGTTGAAG 


350 


TCGTGTTGAA 


400 


GTATTTCAGA 


450 


CGTAAATTGT 


500 


ACGTGGTGTT 


550 


CAGGAACAAT 


600 


ACTAAAGAAG 


650 


TCAATTCTAC 


700 


AAGGCGTTGA 


750 


TTAATTCACC 


800 




809 



50 



2) INFORMATION FOR SEQ ID NO: 1716 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 817 bases 

(B) TYPE: Nucleic acid 

890 
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(C) STRANDEDNESS: Double 

(D) TOPOLOGY: Linear 

(ii) MOLECULE TYPE: Genomic DNA 

(vi) ORIGINAL SOURCE: 

(A) ORGANISM: Abiotrophia defectiva 

(B) STRAIN: ATCC 49176 

(C) ACCESSION NUMBER: AF124225 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 1716 



15 



20 



25 



CGGCGCGATC 
GTGAACACAT 
TTCTTGAACA 
TGAAATGGAA 
ACACTCCAGT 
AACTACGAAG 
TCCAGAACCA 
ACGTATTCTC 
CGTGGTCAAG 
AGAAACTTCT 
TGGATTACGC 
ACTCGTGACC 
CACTCCGYAC 
AAGGTGGTCG 
TTCCGTACAA 
AATGGTTATG 
CAATCGCGAT 



CTCGTTGTAT 
CCTCTTGTCT 
AAGTTGACAT 
GTTCGTGACC 
TATCGCTGGT 
CTAAAGTTTT 
GAACGTGACA 
TATCACTGGT 
TTCGCGTTGG 
AAGACTACCG 
TGAAGCTGGG 
AAATCCAACG 
ACTAAGTTCG 
TCACACTCCA 
CTGACGTAAC 
CCAGGCGACA 
CGAAGAA 



CTGCTGCTGA 
CGTCAAGTTG 
GGTTGACGAC 
TCTTGTCTGA 
TCAGCTTTGA 
AGAATTGATG 
CTGACAAGCC 
CGTGGTACTG 
TGACGAAGTT 
TTACCGGTGT 
GACAACGTTG 
TGGTCAAGTA 
AAGCTGAAGT 
TTCTTCTCTA 
TGGTGTTGTT 
ACGTACAAAT 



CGGCCCAATG 
GTGTTCCTTA 
GAAGAATTGC 
ATACGACTTC 
AAGCTTTAGA 
GAACAAGTTG 
ATTCATGATG 
TTGCAACTGG 
GAAATCGTTG 
TGAAATGTTC 
GTACCTTGTT 
TTATCTAAAC 
GTACGTATTG 
ACTACCGTCC 
ACTTTACCAG 
GGTTGTTGAA 



30 



CCACAAACTC 50 

CATCGTAGTA 100 

TCGAATTAGT 150 

CCAGGCGACG 200 

AGGCGACGCT 250 

ATGCTTACAT 300 

CCAGTCGAAG 350 

TCGTGTTGAA 4 00 

GTATCGAAGA 450 

CGTAAGTTAT 500 

ACGTGGTGTA 550 

CAGGTTCAAT 600 

TCTAAAGAAG 650 

ACAATTCTAC 700 

AAGGTACTGA 7 50 

TTGATCCACC 800 

817 



2) INFORMATION FOR SEQ ID NO: 1717 

35 (i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 814 bases 

(B) TYPE: Nucleic acid 

(C) STRANDEDNESS:. Double 

(D) TOPOLOGY: Linear 



40 



(ii) MOLECULE TYPE: Genomic DNA 



(vi) ORIGINAL SOURCE: 

(A) ORGANISM: Corynebacterium a ceo lens 
45 (B) STRAIN: ATCC 49725 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 1717 

CGGCGCTATC CTGGTTGTTG CTGCAACCGA TGGCCCGATG CCGCAGACCC 50 

50 GCGAGCACGT TCTGCTTGCT CGCCAGGTTG GCGTTCCTTA CATCCTCGTT 100 

GCACTGAACA AGTGCGACAT GGTTGATGAT GAGGAAATCA TCGAGCTCGT 150 

GGAGATGGAG ATCTCCGAGC TGCTCGCAGA GCAGGACTAC GATGAGGAAG 200 

CTCCTATCGT TCACATCTCC GCTCTGAAGG CACTCGAGGG TGACGAGAAG 250 

891 
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TGGGTACAGT 
TGATCCGGAG 
TCTTCACCAT 
GGTCGTCTGA 
GTCCCAGAAC 
ACTACACCGA 
CGTGAGGACG 
CCCTCACACC 
GCGGCCGCCA 
CGCACCACCG 
GGTTATGCCT 
TTGCTATGGA 



CCATCGTTGA 
CGCGCTACCG 
TACCGGCCGC 
ACGTCAACGA 
ACCACCGTTA 
GGCTGGCGAC 
TTGAGCGTGG 
AAGTTCGAGG 
CACCCCGYTC 
ACGTTACCGG 
GGCGACAACG 
CGAG 



CCTGATGGAT 
ATCAGCCGTT 
GGTACCGTTG 
GGACGTTGAG 
CCGGTATCGA 
AACTGTGGTC 
CCAGGTTGTT 
GTTCCGTCTA 
ATGAACAACT 
TGTTGTGAAC 
TTGAGATGTC 



GCCTGCGACA 
CTTGATGCCT 
TTACCGGCCG 
ATCATCGGTA 
GATGTTCCGC 
TGCTTCTGCG 
ATCAAGCCGG 
CGTCCTGAAG 
ACCGTCCTCA 
CTGCCTGAGG 
TGTTGAGCTC 



ACTCCATCCC 300 

ATCGAGGACA 350 

TGTTGAGCGT 400 

TCCAGGAGAA 4 50 

AAGATGATGG 500 

TGGTACCAAG 550 

GCGCTTACAC 600 

AAGGAAGAGG 650 

GTTCTACTTC 700 

GCACCGAGAT 750 

ATCCAGCCTG 800 

814 



15 



20 



25 



30 



2) INFORMATION FOR SEQ ID NO: 1718 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 814 bases 

(B) TYPE: Nucleic acid 

(C) STRANDEDNESS : Double 

(D) TOPOLOGY: Linear 

(ii) MOLECULE TYPE: Genomic DNA 

(vi) ORIGINAL SOURCE: 

(A) ORGANISM: Corynehacterium genltalium 

(B) STRAIN: ATCC 33031 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 1718 



CGGCGCCATC 
GTGAGCACGT 
GCACTGAACA 

35 CGAGATGGAG 
CACCTGTTGT 
TGGGCTAAGC 
GGATCCGGAG 
TCTTCACCAT 

4 0 GGCGTCCTGA 
GTCCACCAAG 
ACACCGCAGA 
CGCGAAGATG 
CCCGCACACC 

45 GTGGCCGCCA 
CGCACCACCG 
GGTTATGCCG 
TTGCTATGGA 



CTGGTTGTTG 
TCTGCTGGCT 
AGTGCGACAT 
GTCCGCGAGC 
TCACATCTCC 
AGATCCTGGA 
CGCGAGACCG 
TACCGGCCGC 
ACCTGAACGA 
ACCACCGTTA 
GGCTGGCGAC 
TTGAGCGTGG 
GAGTTCGAGG 
CACCCCGTTC 
ACGTTACCGG 
GGCGACAACG 
CGAG 



CTGCAACCGA 
CGCCAGGTTG 
GGTTGATGAT 
TGCTGGCTGA 
GCACTGAAGG 
GCTCATGGAG 
ACAAGCCGTT 
GGTACCGTTG 
CGAGGTCGAG 
CCTCCATCGA 
AACGCCGCAC 
TCAGATCGTT 
GCTCCGTCTA 
TTCGACAACT 
TGTTGTGAAG 
TTGACATGTC 



TGGCCCGATG 
GCGTTCCGTA 
GAGGAGCTGC 
GCAGGACTTC 
CCCTGGAGGG 
GCTTGCGACA 
CCTGATGCCG 
TTACCGGCCG 
ATCCTGGGCA 
GATGTTCAAC 
TGCTGCTGCG 
GCTAAGCCGG 
CGTTCTGTCC 
ACCGTCCGCA 
CTGCCGGAGG 
CGTCACCCTG 



CCGCAGACCC 50 

CATCCTAGTT 100 

TGGAGCTCGT 150 

GACGAGGAAG 200 

CGACGAGAAG 250 

ACTCCATCCC 300 

GTTGRGGACA 350 

TGTTGAGCGT 4 00 

TCCGCGAGAA 450 

AAGCTGCTGG 500 

TGGCCTGAAG 550 

GCGAGTACAC 600 

AAGGACGAGG 650 

GTTCTATTTC 700 

GCACCGAGAT 750 

ATCCAGCCGG 800 

814 



50 



2) INFORMATION FOR SEQ ID NO: 1719 
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(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 814 bases 

(B) TYPE: Nucleic acid 

(C) STRANDEDNESS: Double 
5 (D) TOPOLOGY: Linear 

(ii) MOLECULE TYPE: Genomic DNA 

(vi) ORIGINAL SOURCE: 
10 (A) ORGANISM: Corynebacterium jeikeium 

(B) STRAIN: ATCC 43216 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 1719 



15 CGGCGCCATC 
GCGAGCACGT 
GCACTGAACA 
CGAGATGGAG 
CTCCGGTTGT 

20 TGGGCTAACC 
GGATCCGGAG 
TCTTCACCAT 
GGCATCCTGA 
GTCCCAGAAG 

25 ACACCGCAGA 
CGCGAGGACG 
CCCGCACACC 
GCGGCCGCCA 
CGCACCACCG 

30 GGTTATGCCG 
TTGCTATGGA 



CTGGTTGTTG 
TCTGCTGGCY 
AGTGTGACAT 
GTCCGCGAGC 
TCACATCTCC 
AGATTCTCGA 
CGCGAGACCG 
TACCGGTCGC 
ACCTGAACGA 
ACCACCGTTA 
GGCTGGCRAC 
TTGAGCGTGG 
GAGTTCGAGG 
CACCCCGTTC 
ACGTTACCGG 
GGCGACAACG 
CGAG 



CCGCAACCGA 
CGCCAGGTTG 
GGTTGACGAT 
TGCTGGCTGA 
GCACTGAAGG 
GCTGATGCAG 
ACAAGCCGTT 
GGTACCGTTG 
CGAGGTTGAG 
CCTCCATCGA 
AACGCTGCAC 
CCAGATCATC 
GCTCCGTCTA 
TTCGACAACT 
TGTTGTGAAG 
TYGACATGTC 



TGGCCCGATG 
GCGTTCCGTA 
GAGGAGCTGC 
GCAGGACTTC 
CCCTGGAGGG 
GCTTGCGACG 
CCTGATGCCG 
TTACCGGCCG 
ATCCTGGGTA 
GATGTTCAAC 
TGCTGCTGCG 
GCTAAGCCGG 
CGTTCTGTCC 
ACCGTCCGCA 
CTGCCTGAGG 
CGTCACCCTG 



CCGCAGACCC 50 

CATCCTGGTT 100 

TGGAGCTCGT 150 

GACGAGGAAG 200 

CGACGAGAAG 250 

AGTCTATCCC 300 

GTTGWGGACA 350 

TGTTGAGCGT 400 

TCCGCGAGAA 450 

T^GCTGCTGG 500 

TGGTCTGAAG 550 

GCGAGTACAC 600 

AAGGACGAGG 650 

GTTCTACTTC 700 

GCACCGAGAT 7 50 

ATCCAGCCGG 800 

814 



35 2) INFORMATION FOR SEQ ID NO: 1720 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 748 bases 

(B) TYPE: Nucleic acid 
40 (C) STRANDEDNESS: Double 

(D) TOPOLOGY: Linear 

(ii) MOLECULE TYPE: Genomic DNA 

45 (vi) ORIGINAL SOURCE: 

(A) ORGANISM: Coryneibacteriujn pseudodiphtheriticum 

(B) STRAIN: ATCC 10700 



50 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 1720 

CGGCGCTATC TTGGTTGTTG CAGCTACCGA CGGCCCAATG CCACAGACTC 50 
GCGAGCACGT TCTGCTGGCT CGCCAGGTTG GCGTTCCTTA CATCCTGGTT 100 
GCACTAAACA AGTGCGACAT GGTTGACGAC GAGGAAATCC TCGAGCTCGT 150 

893 



SUBSTITUTE SHEET (RULE 26) 



wo 01/23604 



PCT/CA00/0H50 



10 



CGAGATGGAG 
CTCCAATCGT 
TGGGTTAACG 
TGATCCAGAC 
TCTTCACCAT 
GGTTCCCTGA 
GTCCCAGAAG 
ACTACACCGA 
CGTGAAGACG 
CACCCACAAG 
GCGGCCGCCA 
CGCACCACCG 



ATCCGCGAAT 
TCACATCTCC 
CCATCGTTGA 
CGTGCTACCG 
TACCGGTCGT 
AGGTCAACGA 
ACCACCATCA 
GGCCGGCGAC 
TTGAGCGTGG 
AAGTTCGAAG 
CACCCCGTTC 
ACGTTACCGG 



TGCTGGCTGA 
GCAGTCGGCG 
ACTGATGGAT 
ACAAGCCATT 
GGCACCGTTG 
AGAAGTCGAG 
CCGGTATCGA 
AACGCTGGTC 
ACAGGTTATC 
GTTCCGTCTA 
TTCGACAACT 
TGTTGTTACC 



CCAGGAATTC 
CCTTGGAAGG 
GCTTGTGACG 
CCTGATGCCT 
TTACGGGTCG 
ATCATCGGCA 
AATGTTCCGC 
TGCTGCTTCG 
GTTGCTCCAG 
CGTTCTTTCC 
ACCGTCCTCA 
CTGCCTGAGG 



GACGAAGAAG 200 

CGAAGAGAGG 250 

AGTCGATCCC 300 

ATCGAGGACA 350 

TGTTGAGCGT 400 

TCAAGGAAAA 4 50 

AAGATGCTGG 500 

CGGTACCAAG 550 

GTGCTTACAG 600 

AAGGACGAGG 650 

GTTCTACTTC 700 

GCACCGAG 748 



15 



20 



2) INFORMATION FOR SEQ ID NO: 1721 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 813 bases 

(B) TYPE: Nucleic acid 

(C) STRANDEDNESS: Double 

(D) TOPOLOGY: Linear 



25 



30 



(ii) MOLECULE TYPE: Genomic DNA 

(vi) ORIGINAL SOURCE: 

(A) ORGANISM: Corynebacterium striatum 

(B) STRAIN: ATCC 6940 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 1721 



GGCGCTATCT 
CGAGCACGTT 
CACTGAACAA 

35 GAGATGGAGA 
TCCGATCGTT 
GGGTACAGGC 
GATCCGGAGC 
CTTCACCATC 

40 GCTCCCTGAA 
TCCATCTCCA 
CTACACCGAG 
GTGAAGAGGT 
CCTCACACCC 

45 CGGCCGCCAC 
GCACCACCGA 
GTTATGCCTG 
CGCTATGGAC 



TGGTTGTTGC 
CTTCTGGCTC 
GTGCGACATG 
TCCGCGAACT 
CACATCTCTG 
TATCGTTGAC 
GCGAGCTGGA 
ACCGGCCGCG 
CGTCAACGAG 
CCACCGTTAC 
GCTGGCGACA 
TGAGCGCGGC 
AGTTCGAGGG 
ACCCCGTTCA 
CGTTACCGGC 
GCGACAACGT 
GAG 



TGCAACCGAT 
GCCAGGTTGG 
GTTGACGACG 
GCTCGCAGAG 
CTCTGAAGGC 
CTGATGCAGG 
CAAGCCGTTC 
GTACCGTTGT 
GACGTTGAGA 
CGGTATCGAG 
ACTGTGGTCT 
CAGGTTGTTA 
TTCCGTCTAC 
TGGACAACTA 
GTCATCAAGC 
CGAGATGTCY 



GGCCCGRTGC 
CGTTCCTTAC 
AGGAAATTAT 
CAGGACTACG 
TCTTGAGGGC 
CTTGCGATGA 
CTGATGCCAA 
TACTGGCCGT 
TCATCGGTAT 
ATGYTCCGCA 
GCTTCTGCGT 
TTAAGCCGGG 
GTCCTGAAGA 
CCGTCCGCAG 
TGCCTGAGGG 
GTCGAGCTGA 



CGCAGACCCG 50 

ATCCTCGTTG 100 

CGAGCTCGTC 150 

ATGAGGAAGC 200 

GRCGAGAAGT 250 

CTCCATCCCG 300 

TCGAGGACAT 350 

GTTGAGCGTG 4 00 

CCAGGACARG 450 

AGATGATGGA 500 

GGTACCAAGC 550 

CGCTTACACC 600 

AGGAAGAGGG 650 

TTCTACTTCC 700 

CACCGAGATG 750 

TCCAGCCGGT 800 

813 



50 



2) INFORMATION FOR SEQ ID NO: 1722 
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(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 817 bases 

(B) TYPE: Nucleic acid 

(C) STRANDEDNESS: Double 
5 (D) TOPOLOGY: Linear 

(ii) MOLECULE TYPE: Genomic DNA 

(vi) ORIGINAL SOURCE: 
10 (A) ORGANISM: Enterococcus avium 

(B) STRAIN: ATCC 14025 

(C) ACCESSION NUMBER: AF124220 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 1722 
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CGGAGCTATC 
GTGAACACAT 
TTCTTAAACA 
TGAAATGGAA 
ACACTCCAGT 
TCATACGAAG 
CCCAACACCA 
ACGTATTCTC 
CGTGGACAAG 
CGAAACTGCT 
TAGACTACGC 
GCACGTGAAG 
CACTCCACAT 
AAGGTGGACG 
TTCCGTACAA 
AATGGTWATG 
CAATYGCGGT 



TTAGTAGTAT 
CTTGTTATCT 
AAATGGATAT 
GTTCGTGACT 
TATCGCAGGT 
AAAAAATCTT 
GTTCGTGATA 
AATCACTGGT 
TTCGCGTTGG 
AAAACAACTG 
TGAAGCAGGT 
ATATCCAACG 
ACAAAATTCT 
TCATACTCCA 
CTGACGTTUIC 
CCTGGGGATA 
AGAAGAC 



CTGCTGCTGA 
CGTAACGTTG 
GGTTGACGAT 
TATTAACTGA 
TCAGCGTTGA 
AGAATTAATG 
CTGACAAACC 
CGTGGTACTG 
TGACGAAGTT 
TTACAGGTGT 
GACAACATCG 
TGGACAAGTA 
CTGCAGAAGT 
TTCTTCACTA 
TGGTGTAGTT 
ACGTAACTAT 



TGGCCCTATG 
GTGTTCCTTA 
GAAGAATTAC 
ATACGACTTC 
AAGCTTTAGA 
GCTGCTGTTG 
ATTCATGATG 
TTGCAACTGG 
GAAATCGTAG 
TGAAATGTTC 
GTGCTTTGTT 
TTGGCTAAAC 
TTATGTTCTA 
ACTACCGTCC 
GATCTACCAG 
GGAAGTTGAA 



CCTCAAACTC 50 

CATCGTTGTA 100 

TTGAATTAGT 150 

CCAGGCGACG 200 

AGGCGACGCT 250 

ACGAATATAT 300 

CCAGTCGAAG 350 

TCGTGTTGAA 400 

GTATCGCTGA 450 

CGTAAATTGT 500 

ACGTGGTGTT 550 

CAGCTTCAAT 600 

ACTAAAGAAG 650 

TCAGTTCTAC 700 

AAGGTACTGA 750 

TTGATYCACC 800 

817 



35 



45 



2) INFORMATION FOR SEQ ID NO: 1723 



(i) SEQUENCE CHARACTERISTICS: 
(A) LENGTH: 823 bases 
40 (B) TYPE: Nucleic acid 

(C) STRANDEDNESS: Double 

(D) TOPOLOGY: Linear 



(ii) MOLECULE TYPE: Genomic DNA 

(vi) ORIGINAL SOURCE: 

(A) ORGANISM: Gardnerella vaginalis 

(B) STRAIN: ATCC 14018 

50 (xi) SEQUENCE DESCRIPTION: SEQ ID NO: 1723 

TGGCGCAATC CTCGTGGTTG CTGCTACCGA CGGTCCAATG GCTCAGACCC 50 
GTGAACACGT CTTGCTTGCT AAGCAGGTCG GCGTTCCAAA AATTCTTGTT 100 

895 
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GCTTTGAACA 
TGAAGAAGAG 
GCCCAGTCYT 
GACCACGACA 
CGAGTACATC 
CAATCGAAGA 
CGTGTTGAGC 
TTTGCGCGAT 
AGCAGATGGA 
GGTATCAACC 
TTCTGTGACT 
AGGACGAAGG 
TTCTACTTCC 
CATCGAAATG 
TCCAGGCTAT 



AGTGCGATAT 
GTCCGTGACC 
CCGTACTTCC 
AGTGGGTAGA 
CCAACCCCAA 
TGTGTTCACC 
GTGGTAAGCT 
ACCCAGACCA 
TGAGGCAGAG 
GTACCGACGT 
CCACACACCA 
TGGCCGTCAC 
GTACCACCGA 
GTTCAGCCAG 
CGCAATGGAA 



GGTTGACGAC 
TCCTCGAAGA 
GCTTACGGCG 
GACCGTCAAG 
CTCACGATCT 
ATCTCCGGTC 
CCCAATCAAC 
CCACCGTCAC 
GCTGGCGATA 
TGAGCGTGGT 
AGTTCGAAGG 
TCGCCATTCT 
TGTTACTGGC 
GCGATCACGC 
GAG 



GAAGAGCTTA 
AAACGGCTTC 
CTTTGCATGA 
GAACTCATGA 
TGACAAGCCA 
GTGGTYCCGT 
ACCCCAGTTG 
CTCTATCGAG 
ACACTGGTCT 
CAGGTTGTGG 
CGAAGTTTAC 
TCTCCAACTA 
GTTATCACCT 
AACCTTCACT 



TCGATCTCGT 150 

GATCGCGATT 200 

TGACGCTCCA 250 

AGGCTGTTGA 300 

TTCTTGATGC 350 

TGTCACCGGT 400 

AGATCGTTGG 450 

ACCTTCCACA 500 

TCTTCTCCGC 550 

CTGCTCCAGG 600 

GTCTTGACCA 650 

CCGTCCACAG 700 

TGCCAGACGG 750 

GTTGAGTTGA 800 

823 



20 



25 



2) INFORMATION FOR SEQ ID NO: 1724 



(i) 



SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 817 bases 

(B) TYPE: Nucleic acid 

(C) STRANDEDNESS: Double 

(D) TOPOLOGY: Linear 



(ii) MOLECULE TYPE: Genomic DNA 



(vi) ORIGINAL SOURCE: 
30 (A) ORGANISM: Listeria innocua 

(B) STRAIN: ATCC 33090 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 1724 



35 CGGAGCTATC 
GTGAACATAT 
TTCATGAACA 
TGAAATGGAA 
ACATTCCTGT 

40 GACTGGGAAG 
TCCAACTCCA 
ATGTATTCTC 
CGTGGACAAG 
AGAAAGCAAA 

45 TAGACTACGC 
GCTCGTGAAG 
TACTCCACAC 
AAGGTGGACG 
TTCCGTACTA 

50 AATGGTAATG 
CAATCGCTAT 



TTAGTAGTAT 
CTTACTTTCA 
AATGTGACAT 
ATTCGTGATC 
AATCAAAGGT 
CTAAAATTGA 
GAACGTGATA 
T^TCACTGGT 
TTAAAGTTGG 
AAAGTAGTAG 
TGAAGCTGGC 
ATATCCAACG 
ACTAACTTCA 
TCACACTCCA 
CTGACGTAAC 
CCTGGTGATA 
CGAAGAC 



CTGCTGCTGA 
CGTCAAGTTG 
GGTTGACGAT 
TATTAACTGA 
TCAGCTCTTA 
CGAGTTAATG 
CTGACAAACC 
CGTGGAACAG 
TGACGAAGTA 
TAACTGGAGT 
GACAACATTG 
TGGTCAAGTA 
AAGCTGAAAC 
TTCTTCAACA 
TGGTATTGTT 
ACATTGAGCT 



TGGCCCAATG 
GTGTTCCATA 
GAAGAATTAC 
ATATGAATTC 
AAGCACTTCA 
GAAGCTGTAG 
ATTCATGATG 
TTGCAACTGG 
GAAGTTATCG 
AGAAATGTTC 
GCGCACTTCT 
TTAGCTAAAC 
TTATGTTTTA 
ACTACCGCCC 
ACACTTCCAG 
TGCAGTTGAA 



CCACAAACTC 50 

CATCGTTGTA 100 

TAGAATTAGT 150 

CCTGGCGATG 200 

AGGTGAAGCT 250 

ATTCTTACAT 300 

CCAGTTGAGG 350 

ACGTGTTGAA 4 00 

GTATTGAAGA 4 50 

CGTAAATTAC 500 

ACGTGGTGTT 550 

CAGGTTCGAT 600 

ACTAAAGAAG 650 

ACAATTCTAT 700 

AAGGTACTGA 750 

CTAATTGCAC 800 

817 



896 



SUBSTITUTE SHEET (RULE 26) 



10 



15 



WO 01/23604 

2) INFORMATION FOR SEQ ID NO: 1725 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 818 bases 

(B) TYPE: Nucleic acid 

(C) STRANDEDNESS: Double 

(D) TOPOLOGY: Linear 

. (ii) MOLECULE TYPE: Genomic DNA 

(vi) ORIGINAL SOURCE: 

(A) ORGANISM: Listeria ivanovii 

(B) STRAIN: ATCC 19119 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 1725 
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CGGAGCTATC 
GTGAACATAT 
ATTCATGAAC 
TTGAAATGGA 
GACATTCCTG 
TGATTGGGAA 
TTCCAACTCC 
GATGTATTCT 
ACGTGGACAA 
AAGAAAGCAA 
CTAGACTACG 
TGCTCGTGAA 
TTACTCCACA 
GAAGGTGGAC 
TTTCCGTACT 
AAATGGTAAT 
CCAATCGCTA 



TTAGTAGTAT 
TCTTACTTTC 
AAATGTGACA 
AATTCGTGAT 
TAATCAAAGG 
GCTAAAATTG 
AGAACGTGAT 
CAATCACTGG 
GTTAAAGTTG 
AAAAGTAGTA 
CTGAAGCTGG 
GATATCCAAC 
TACTAACTTC 
GTCATACTCC 
ACTGACGTAA 
GCCTGGTGAT 
TCGAAGAC 



CTGCTGCTGA 
ACGTCAAGTT 
TGGTTGACGA 
CTATTAACTG 
TTCAGCTCTT 
ACGAGTTAAT 
ACTGACAAAC 
TCGTGGAACA 
GTGACGAAGT 
GTAACTGGAG 
CGACAACATT 
GTGGTCAAGT 
AAAGCTGAAA 
ATTCTTCAAC 
CTGGTATTGT 
AACATTGAGC 



TGGTCCAATG 
GGTGTTCCAT 
TGAAGAATTA 
AATATGAATT 
AAAGCACTTC 
GGAAGCTGTA 
CATTCATGAT 
GTTGCAACTG 
AGAAGTTATC 
TAGAAATGTT 
GGCGCACTTC 
ATTAGCTAAA 
CTTATGTTTT 
AACTACCGCC 
TACACTTCCA 
TTGCAGTTGA 



CCACAAACTC 50 

ACATCGTTGT 100 

CTTGAATTAG 150 

CCCTGGCGAC 200 

AAGGTGAAGC 250 

GATTCTTACA 300 

GCCAGTTGAG 350 

GACGTGTTGA 400 

GGTATTGAAG 4 50 

CCGTAAATTA 500 

TACGTGGTGT 550 

CCAGGTTCGA 600 

AACTAAAGAA 650 

CACAATTCTA 700 

GAAGGTACTG 750 

ACTAATTGCA 800 

818 



35 



40 



45 



50 



2) INFORMATION FOR SEQ ID NO: 1726 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 817 bases 

(B) TYPE: Nucleic acid 

(C) STRANDEDNESS: Double 

(D) TOPOLOGY: Linear 

(ii) MOLECULE TYPE: Genomic DNA 

(vi) ORIGINAL SOURCE: 

(A) ORGANISM: Listeria monocytogenes 

(B) STRAIN: LSPQ 5093202 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 1726 

CGGAGCTATC TTAGTAGTAT CTGCTGCTGA TGGCCCAATG 

897 



CCACAAACTC 
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SUBSTITUTE SHEET (RULE 26) 
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GTGAACATAT 
TTCATGAACA 
TGAAATGGAA 
ACATTCCTGT 
GACTGGGAAG 
TCCAACTCCW 
ATGTATTCTC 
CGTGGACAAG 
AGAAAGCAAA 
TAGACTACGC 
GCTCGTGAAG 
TACTCCACAC 
AAGGTGGACG 
TTCCGTACTA 
AATGGTAAYG 
CAATCGCTAT 



CTTACTTTCA 
AATGTGACAT 
ATTCGTGATC 
AATCAAAGGT 
CTAAAATTGA 
GAACGTGATA 
AATCACTGGT 
TTAAAGTTGG 
AAAGTAGTAG 
TGAAGCTGGC 
ATATCCAACR 
ACTAACTTCA 
TCACACTCCA 
CTGACGTAAC 
CCTGGTGATA 
CGAAGAC 



CGTCAAGTTG 
GGTTGACGAT 
TATTAACTGA 
TCAGCTCTTA 
CGAGTTAATG 
CTGACAAACC 
CGTGGAACAG 
TGACGAAGTA 
TAACTGGAGT 
GACAACATTG 
TGGTCAAGTA 
AAGCTGAAAC 
TTCTTCAACA 
TGGTATTGTT 
ACATTGAGCT 



GTGTTCCATA 
GAAGAATTAC 
ATATGAATTC 
AAGCACTTCA 
GAAGCTGTAG 
ATTCATGATG 
TTGCAACTGG 
GAAGTTATCG 
AGAAATGTTC 
GCGCACTTCT 
TTAGCTAAAC 
TTATGTTTTA 
ACTACCGCCC 
ACACTTCCAG 
TGCAGTTGAA 



CATCGTTGTA 100 

TAGAATTAGT 150 

CCTGGCGATG 200 

AGGTGAAGCT 250 

ATTCTTACAT 300 

CCAGTTGAGG 350 

ACGTGTTGAA 400 

GTATCGAAGA 450 

CGTAAATTAC 500 

ACGTGGTGTT 550 

CAGGTTCGAT 600 

ACTAAAGAAG 650 

ACAATTCTAT 700 

AAGGTACTGA 750 

CTAATTGCAC 800 
817 



20 2) INFORMATION FOR SEQ ID NO: 1727 
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(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 817 bases 

(B) TYPE: Nucleic acid 

(C) STRANDEDNESS: Double 

(D) TOPOLOGY: Linear 



(ii) MOLECULE TYPE: Genomic DNA 



30 



35 



(vi) ORIGINAL SOURCE: 

(A) ORGANISM: Listeria seeligeri 

(B) STRAIN: ATCC 35967 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 1727 



40 



45 
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CGGAGCTATC 
GTGAACATAT 
TTCATGAACA 
TGAAATGGAA 
ACATTCCTGT 
GACTGGGAAG 
TCCAACTCCA 
ATGTATTCTC 
CGTGGACAAG 
AGA/yiGCAAA 
TAGACTACGC 
GCTCGTGAAG 
TACTCCACAT 
AAGGTGGACG 
TTCCGTACTA 
AATGGTAATG 
CAATCGCTAT 



TTAGTAGTAT 
CTTACTTTCA 
AATGTGACAT 
ATTCGTGATC 
AATCAAAGGT 
CTAAAATTGA 
GAACGTGATA 
AATCACTGGT 
TTAAAGTTGG 
AAAGTAATAG 
TGAAGCTGGC 
ATATCCAACG 
ACTAACTTCA 
TCACACTCCA 
CTGACGTAAC 
CCTGGTGATA 
CGAAGAC 



CTGCTGCTGA 
CGTCAAGTTG 
GGTTGACGAT 
TATTAACTGA 
TCAGCTCTTA 
CGAGTTAATG 
CTGACAAACC 
CGTGGAACTG 
TGACGAAGTA 
TAACTGGAGT 
GACAACATTG 
TGGTCAAGTA 
AAGCTGAAAC 
TTCTTCAACA 
TGGTATTGTT 
ACATTGAGCT 



TGGCCCAATG 
GTGTTCCATA 
GAAGAATTAC 
ATATGAATTC 
AAGCACTTCA 
GAAGCTGTAG 
ATTCATGATG 
TTGCAACTGG 
GAAGTTATCG 
AGAAATGTTC 
GCGCACTTCT 
TTAGCTAAAC 
TTATGTTTTA 
ACTACCGCCC 
ACACTTCCAG 
TGCAGTTGAA 



CCACAAACTC 50 

CATCGTTGTA 100 

TTGAATTAGT 150 

CCTGGTGATG 200 

AGGTGAAGCT 250 

ATTCTTACAT 300 

CCAGTTGAGG 350 

ACGTGTTGAA 400 

GTATTGAAGA 450 

CGTAAATTAC 500 

ACGTGGTGTT 550 

CAGGTTCGAT 600 

ACTAAAGAAG 650 

ACAATTCTAT 700 

AAGGTACTGA 750 

CTAATTGCAC 800 

817 
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(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 814 bases 

(B) TYPE: Nucleic acid 

(C) STRANDEDNESS : Double 

(D) TOPOLOGY: Linear 

(ii) MOLECULE TYPE: Genomic DNA 

(vi) ORIGINAL SOURCE: 

(A) ORGANISM: Staphylococcus aureus 

(B) STRAIN: ATCC 25923 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 1728 



20 



25 
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CGGTGGTATC 
GTGAACACAT 
TTCTTAAACA 
AGAAATGGAA 
ATGTACCTGT 
CAATACGAAG 
TCCAACTCCA 
ACGTATTCTC 
CGTGGTCAAA 
CACATCTAAA 
ACTACGCTGA 
CGTGAAGACG 
ACCACATACT 
GTGGACGTCA 
CGTACTACTG 
GGTAATGCCT 
TCGCGATTGA 



TTAGTAGTAT 
TCTTTTATCA 
AAGTTGACAT 
GTTCGTGACT 
AATCGCTGGT 
AAAAAATCTT 
GAACGTGATT 
AATCACTGGT 
TCAAAGTTGG 
ACAACTGTTA 
AGCTGGTGAC 
TACAACGTGG 
GAATTCAAAG 
CACTCCATTC 
ACGTAACTGG 
GGTGATAACG 
AGAC 



CTGCTGCTGA 
CGTAACGTTG 
GGTTGACGAT 
TATTAAGCGA 
TCAGCATTAR 
AGAATTARTG 
CTGACAAACC 
CGTGGTACTG 
TGAAGAAGTT 
CAGGTGTTGA 
AACATTGGTG 
TCAAGTATTA 
CAGAAGTATA 
TTCTCAAACT 
TGTTGTTCAC 
TTGAAATGAC 



CGGTCCAATG 
GTGTACCAGC 
GAAGAATTAT 
ATATGACTTC 
AAGCTTTAGA 
GAAGCTGTAG 
ATTCATGATG 
TTGCTACAGG 
GAAATCATCG 
AATGTTCCGT 
CATTATTACG 
GCTGCTCCTG 
CGTATTATCA 
ATCGTCCACA 
TTACCAGAAG 
AGTAGAATTA 



CCACAAACTC 50 

ATTAGTAGTA 100 

TAGAATTAGT 150 

CCAGGTGACG 200 

AGGCGATGCT 250 

ATACTTACAT 300 

CCAGTTGAGG 350 

CCGTGTTGAA 400 

GTTTACATGA 4 50 

AAATTATTAG 500 

TGGTGTTGCT 550 

GTTCAATTAC 600 

AAAGACGAAG 650 

ATTCTATTTC 700 

GTACTGAAAT 7 50 

ATCGCTCCAA BOQ 

814 
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2) INFORMATION FOR SEQ ID NO: 1729 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 817 bases 

(B) TYPE: Nucleic acid 

(C) STRANDEDNESS: Double 

(D) TOPOLOGY: Linear 

(ii) MOLECULE TYPE: Genomic DNA 

(vi) ORIGINAL SOURCE: 

(A) ORGANISM: Staphylococcus saprophyticus 

(B) STRAIN: ATCC 15305 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 1729 

CGGAGCTATC TTAGTAGTAT CTGCTGCTGA TGGCCCAATG CCACAAACTC 

899 
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GTGAACACAT 
TTCTTAAACA 
AGAAATGGAA 
ATGTACCTGT 
GACTATGAGC 
TCCAACACCA 
ACGTATTCTC 
CGTGGTCAAA 
AGAATCAAGC 
TAGACTACGC 
TCACGTGATG 
CACACCACAT 
AAGGTGGTCG 
TTCCGTACTA 
AATGGTTATG 
CAATCGCTAT 



TCTTTTATCA 
AAGTTGACAT 
GTTCGTGRCT 
AATCTCTGGT 
AAAAAATCTT 
GAACGTGATT 
AATCACTGGT 
TCAAAGTCGG 
AAAACAACTG 
TGAAGCTGGT 
ATGTACAACG 
ACAAAATTCA 
TCATACGCCA 
CTGACGTAAC 
CCTGGCGATA 
TGAAGAC 



CGTRACGTTG 
GGTTGACGAY 
TATTAAGCGA 
TCTGCATTAA 
AGACTTAATG 
CTGACAAACC 
CGTGGTACTG 
TGAAGAAATC 
TTACTGGTGT 
GACAACATTG 
TGGTCAAGTT 
AAGCGGATGT 
TTCTTCACTA 
TGGTGTTGTT 
ACGTTGAAAT 



GTGYTCCAGC 


ATTAGTTGTA 


100 


GAAGAATTAT 


TAGAATTRGT 


150 


ATATGACTTC 


CCAGGTGACG 


200 


AAGCTTTAGA 


AGGCGACGCT 


250 


CAAGCTGTTG 


ATGACTYCAT 


300 


ATTCATGATG 


CCAGTTGAGG 


350 


TTGCTACAGG 


CCGTGTTGAA 


400 


GARATCATCG 


GTATGCAAGA 


450 


AGAAATGTTC 


CGTAAATTAT 


500 


GTGCATTATT 


ACGTGGTGTT 


550 


TTAGCTGCTC 


CTGGTACTAT 


600 


TTACGTTTTA 


TCTAAAGATG 


650 


ACTACCGCCC 


ACAATTCTAT 


700 


AACTTACCAG 


AAGGTACTGA 


750 


GGATGTTG7U\ 


TTAATTTCTC 


800 






817 



20 2) INFORMATION FOR SEQ ID NO: 1730 
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(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 817 bases 

(B) TYPE: Nucleic acid 

(C) STRANDEDNESS: Double 

(D) TOPOLOGY: Linear 



(ii) MOLECULE TYPE: Genomic DNA 



30 



35 



(vi) ORIGINAL SOURCE: 

(A) ORGANISM: Staphylococcus simulans 

(B) STRAIN: ATCC 27848 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 1730 
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CGGCGGTATC 
GTGAACACAT 
TTCTTAAACA 
TGAAATGGAA 
ATGTACCAGT 
GAATACGAAC 
CCCAACTCCA 
ACGTATTCTC 
CGTGGTCAAA 
AGAAAGCAAG 
TAGACTACGC 
GCACGTGAAG 
TACTCCACAC 
AAGGTGGACG 
TTCCGTACTA 
AATGGTTATG 
CAATCGCGAT 



TTAGTAGTAT 
CTTATTATCA 
AAGCTGACAT 
GTTCGTGACT 
TATCGTTGGT 
AAAAAATCTT 
GAACGTGACT 
AATCACTGGT 
TCAAAGTCGG 
AAAACAACAG 
TGAAGCTGGT 
ACGTACAACG 
ACAAAATTCA 
TCATACTCCA 
CTGACGTAAC 
CCTGGCGATA 
TGAAGAC 



CTGCTGCAGA 
CGTAACGTTG 
GGTTGACGAC 
TATTATCTGA 
TCTGCATTAA 
AGACTTAATG 
CTGATAAACC 
CGTGGTACTG 
TGAAGAAGTT 
TTACAGGTGT 
GACAACATCG 
TGGACAAGTA 
AAGCTGATGT 
TTCTTCACTA 
TGGCGTTGTT 
ACGTAG/^AAT 



TGGTCCAATG 
GTGTACCAGC 
GAAGAATTAT 
ATACGACTTC 
AAGCTTTAGA 
CAAGCTGTAG 
ATTCATGATG 
TAGCAACAGG 
GAAATCATCG 
AGAAATGTTC 
GTGCTTTATT 
TTAGCAGCTC 
TTACGTTTTA 
ACTACCGCCC 
CACTTACCAG 
GACTGTTGAA 



CCACAAACTC 50 

TTTAGTTGTA 100 

TAGAATTAGT 150 

CCTGGTGACG 200 

AGGCGACCCA 250 

ATGACTACAT 300 

CCAGTTGAGG 350 

CCGTGTTGAA 4 00 

GTATCACTGA 450 

CGTAAATTAT 500 

ACGTGGTGTT 550 

CTGGCTCTAT 600 

TCTAAAGAAG 650 

ACAATTCTAC 700 

AAGGTACTGA 750 

TTGATCGCTC 800 
817 
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2) INFORMATION FOR SEQ ID NO: 1731 



PCT/CAOO/01150 



(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 817 bases 

(B) TYPE: Nucleic acid 

(C) STRANDEDNESS: Double 

(D) TOPOLOGY: Linear 



10 



15 



20 



(ii) MOLECULE TYPE: Genomic DNA 

(vi) ORIGINAL SOURCE: 

(A) ORGANISM: Streptococcus agalactlae 

(B) STRAIN: ATCC 27591 

(C) ACCESSION NUMBER: 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 1731 



25 
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CGGAGCTATC 
GTGAGCACAT 
TTCATGAACA 
TGAAATGGAA 
ACCTTCCAGT 
AAATACGAAG 
TCCAGAACCA 
ATGTATTCTC 
CGTGGTACTG 
AGATATCCAA 
TTGACGAAGG 
CAACGTGATG 
CAACCCACAC 
AAGGTGGACG 
TTCCGTACAA 
AATGGTTATG 
CAATCGCCGT 



CTTGTAGTTG 
CCTTCTTTCA 
AAGTTGACCT 
ATTCGTGACC 
TATCCAAGGT 
ACATCATCAT 
GAACGTGATA 
AATCACTGGA 
TTCGTGTCAA 
AAAGCAGTTG 
TCTTGCAGGG 
AAATCGAACG 
ACTAAATTTA 
TCATACTCCA 
CTGACGTAAC 
CCTGGTGATA 
AGAACAA 



CTTCAACTGA 
CGTCAAGTTG 
TGTTGATGAT 
TTCTTTCAGA 
TCAGCTCTTA 
GGAATTGATG 
CTGACAAACC 
CGTGGTACAG 
CGACGAAGTT 
TTACTGGTGT 
GACAACGTTG 
TGGTCAAGTT 
AAGGTGAAGT 
TTCTTCAACA 
AGGTTCAATC 
ACGTTACTAT 



TGGACCAATG 
GTGTTAAACA 
GAAGAATTGC 
ATACGACTTC 
AAGCACTTGA 
AGCACTGTTG 
TTTACTTCTT 
TTGCTTCAGG 
GAAATCGTTG 
TGAAATGTTC 
GTGTTCTTCT 
CTTGCTAAAC 
TTACATCCTT 
ACTACCGTCC 
GAACTTCCAG 
CGAAGTTGAA 



35 



CCACAAACTC 


50 


CCTTATCGTA 


100 


TTGAATTGGT 


150 


CCAGGTGATG 


200 


AGGCGACGAA 


250 


ATGAGTACAT 


300 


CCAGTTGAAG 


350 


ACGTATCGAC 


400 


GTATTAAAGA 


450 


CGTAAACAAC 


.500 


TCGTGGTGTT 


550 


CAGGTTCAAT 


600 


TCTAAAGAAG 


650 


ACAATTCTAC 


700 


CAGGTU^CAGA 


750 


TTGATTCACC 


800 




817 



2) INFORMATION FOR SEQ ID NO: 1732 

40 (i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 817 bases 

(B) TYPE: Nucleic acid 

(C) STRANDEDNESS: Double 

(D) TOPOLOGY: Linear 



45 



(ii) MOLECULE TYPE: Genomic DNA 



(vi) ORIGINAL SOURCE: 

(A) ORGANISM: Streptococcus pneumoniae 
50 (B) STRAIN: ATCC 27336 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 1732 
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CGGAGCTATC 
GTGAGCACAT 
TTCATGAACA 
TGAAATGGAA 
ATCTTCCAGT 
AAATACGAAG 
CCCAGAACCA 
ACGTATTCTC 
CGTGGTATCG 
AGAAACTCRA 
TTGACGAAGG 
CAACGTGATG 
CAACCCACAC 
AAGGTGGACG 
TTCCGTACTA 
AATGGTAATG 
CAATCGCCGT 



CTTGTAGTAG 
CCTTCTTTCA 
AAGTTGACTT 
ATCCGTGACC 
TATCCAAGGT 
ACATCGTTAT 
GAACGTGACA 
AATCACTGGA 
TTAAAGTCAA 
AAAGCAGTTG 
TCTTGCTGGA 
AAATCGAACG 
ACTAAATTCA 
TCACACTCCA 
CTGACGTTAC 
CCTGGTGATA 
AGAACAA 



CTTCAACTGA 
CGTCAGGTTG 
GGTTGACGAC 
TATTGTCAGA 
TCAGCACTTA 
GGAATTGATG 
CTGACAAACC 
CGTGGTACAG 
CGACGAAATC 
TTACTGGTGT 
GATAACGTAG 
TGGACAAGTT 
AAGGTG7y\GT 
TTCTTCAACA 
AGGTTCAATC 
ACGTGACAAT 



CGGACCAATG 
GTGTTAAACA 
GAAGAATTGC 
ATACGACTTC 
AAGCTCTTGA 
AACACAGTTG 
ATTGCTTCTT 
TTGCTTCAGG 
GAAATCGTTG 
TGAAATGTTC 
GTGTCCTTCT 
ATCGCTAAAC 
CTACATCCTT 
ACTACCGTCC 
GAACTTCCAG 
CGACGTTGAG 



CCACAAACTC 50 

CCTTATCGTC 100 

TTGAATTGGT 150 

CCAGGTGACG 200 

AGGTGACTCT 250 

ATGAGTATAT 300 

CCAGTCGAGG 350 

ACGTATCGAC 400 

GTATCAAAGA 450 

CGTAAACAAC 500 

TCGTGGTGTT 550 

CAGGTTCAAT 600 

ACTAAAGAAG 650 

ACAATTCTAC 700 

CAGGTACTGA 7 50 

TTGATTCACC 800 
817 
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2) INFORMATION FOR SEQ ID NO: 1733 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 817 bases 

(B) TYPE: Nucleic acid 

(C) STRANDEDNESS: Double 

(D) TOPOLOGY: Linear 
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(ii) MOLECULE TYPE: Genomic DNA 

(vi) ORIGINAL SOURCE: 

(A) ORGANISM: Streptococcus salivarius 

(B) STRAIN: ATCC 7073 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 1733 



CGGTGCGATC 
GTGAGCACAT 
TTCATGAACA 
TGAAATGGAA 
ACATTCCAGT 
AAATACGAAG 
CCCAGAACCA 
ACGTATTCTC 
CGTGGTGTTG 
AGACATCCAA 
TTGACGRAGG 
CAACGTGATG 
CAACCCACAC 
AAGGTGGACG 
TTCCGTACAA 
AATGGTTATG 
CAATCGCCGT 



CTTGTAGTAG 
CCTTCTTTCA 
AAGTTGACTT 
ATCCGTGACC 
TATCCAAGGT 
ACATCATCAT 
GAACGTGACA 
AATCACTGGT 
TTCGTGTCAA 
AAAGCAGTTG 
TATTGCCGGA 
TW^TCGAACG 
ACTAAATTCA 
TCACACTCCA 
CTGACGTAAC 
CCTGGTGATA 
TGAACAA 



CATCTACTGA 
CGTCAGGTTG 
GGTTGACGAT 
TTCTTTCAGA 
TCAGCTCTTA 
GGACTTGATG 
CTGACAAACC 
CGTGGTACTG 
TGACGAAGTT 
TTACTGGTGT 
GATAACGTCG 
TGGTCAAGTA 
AAGGTGAAGT 
TTCTTCAACA 
AGGTTCAATC 
ACGTGACTAT 



CGGACCAATG 
GTGTTAAACA 
GAAGAATTGC 
ATACGATTTC 
AAGCTCTTGA 
AACACTGTTG 
ATTGTTGCTT 
TTGCTTCAGG 
GAAATCGTTG 
TGAAATGTTC 
GTGTTCTTCT 
TTGGCTGCAC 
TTACATCCTT 
ACTACCGTCC 
GAACTTCCTG 
CGACGTTGAG 



CCACAAACTC 50 

CCTTATCGTC 100 

TTGAATTGGT 150 

CCAGGTGATG 200 

AGGTGATTCT 250 

ACGAATACAT 300 

CCAGTCGAAG 350 

ACGTATCGAC 400 

GTCTTAAAGA 4 50 

CGTAAACAAC 500 

TCGTGGTATC 550 

CTGGTTCAAT 600 

TCTAAAGAAG 650 

ACAGTTCTAC 700 

CAGGTACTGA 750 

TTGATCCACC 800 
817 
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2) INFORMATION FOR SEQ ID NO: 1734 
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10 



15 



(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 8 97 bases 

(B) TYPE: Nucleic acid 

(C) STRANDEDNESS: Double 

(D) TOPOLOGY: Linear 

(ii) MOLECULE TYPE: Genomic DNA 

(vi) ORIGINAL SOURCE: 

(A) ORGANISM: Agrobacterium tumefaciens 
(C) ACCESSION NUMBER: x99673 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 1734 



20 
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AACATGATCA 
GGCTGCCGAC 
GTCAGGTGGG 
GTTGACGACG 
TCTGTCGTCC 
CGGCACTTGC 
ATCCGCGAGC 
TCCGATCGAC 
CGGGTCGTGG 
GTTGGTGAAG 
TGTTACCGGC 
GCGACAACAT 
CGTGGTCAGA 
CATGGCAGAA 
CGTTCTTCAC 
ACCGGTATCG 
CAACGTCACT 
AGCTGCGCTT 



CCGGTGCTGC 
GGCCCGATGC 
CGTTCCGGCC 
CCGAGCTTCT 
TACGACTTCC 
TGCTCTTGAA 
TGATGGCTGC 
CAGCCGTTCC 
TACGGTTGTG 
AAGTCGAAAT 
GTTGAAATGT 
CGGTGCACTC 
TCCTGTGCAA 
GCCTACATCC 
GAACTACCGT 
TTTCGCTTCC 
GTTGAAGTCG 
CGCTATCCGC 



CGAGATGGAC 
CACAGACCCG 
ATCGTCGTGT 
CGAGCTCGTC 
CGGGCGACGA 
GATTCTGACA 
TGTCGACGCC 
TGATGCCGAT 
ACGGGTCGCG 
CGTCGGCATC 
TCCGCAAGCT 
GTTCGCGGCG 
GCCGGGTTCG 
TGACGAAGGA 
CCGCAGTTCT 
TGAAGGCACG 
AGCTGATCGT 
GAAGGCGGCC 



GGCGCGATCC 
CGAGCACATC 
TCCTCAACAA 
GAGCTTGAAG 
TATCCCGATC 
AGAAGATCGG 
TACATCCCGA 
CGAAGACGTG 
TTGAGCGCGG 
CGTCCGACCT 
GCTCGACCAG 
TTACCCGTGA 
GTCAAGCCGC 
AGAAGGCGGC 
ACTTCCGTAC 
GAAATGGTTA 
TCCGATCGCG 
GTACCGTCGG 



TGGTTTGCTC 50 

CTGCTTGCCC 100 

GGTCGACCAG 150 

TTCGCGAACT 200 

ATCAAGGGTT 250 

TGAAGACGCG 300 

CGCCTGAGCG 350 

TTCTCGATCT 400 

TATCGTCAAG 450 

CGAAGACGAC 500 

GGCCAGGCCG 550 

CGGCGTCGAG 600 

ACAAGAAGTT 650 

CGTCATACGC 700 

GACTGACGTT 750 

TGCCTGGCGA 800 

ATGGAAGAAA 850 

CGCCGGC 897 



35 



2) INFORMATION FOR SEQ ID NO: 1735 

4 0 (i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 885 bases 

(B) TYPE: Nucleic acid 

(C) STRANDEDNESS: Double 

(D) TOPOLOGY: Linear 



45 



(ii) MOLECULE TYPE: Genomic DNA 



(vi) ORIGINAL SOURCE: 

(A) ORGANISM: Bacillus subtilis 
50 (B) STRAIN: 168 

(C) ACCESSION NUMBER: Z99104 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 1735 

903 
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ATGATCACTG 
TGCTGATGGC 
ACGTTGGTGT 
5 GACGACGAAG 
TAGCGAATAC 
CTCTTAAAGC 
CTTATGGATG 
AAAACCATTC 

10 GTACAGTTGC 
G7U\GTTGAAA 
AGGTGTTGAA 
ACATTGGTGC 
CAAGTACTTG 

15 TGAAGTTTAC 
TCTCTAACTA 
ATCATCCATC 
TGAAATGAAC 
GTTTCTCTAT 

20. 



GTGCTGCGCA 
CCAATGCCAC 
ACCATACATC 
AGCTTCTTGA 
GACTTCCCTG 
TCTTGAAGGA 
CGGTTGATGA 
ATGATGCCAG 
TACTGGCCGT 
TCATCGGTCT 
ATGTTCCGTA 
CCTTCTTCGC 
CTAAACCAGG 
GTTCTTTCTA 
CCGTCCTCAG 
TTCCAGAAGG 
GTTGAACTTA 
TCGTGAAGGC 



AATGGACGGA 
AAACTCGTGA 
GTTGTATTCT 
ACTAGTTGAA 
GTGATGATGT 
GACGCTGAGT 
GTACATCCCA 
TTGAGGACGT 
GTAGAACGCG 
TCAAGAAGAG 
AGCTTCTTGA 
GGTGTATCTC 
TACAATCACT 
AAGAAGAGGG 
TTCTACTTCC 
CGTAGAAATG 
TTTCTACAAT 
GGACGTACTG 



GCTATCCTTG 
GCACATCCTT 
TAAACAAATG 
ATGGAAGTTC 
ACCAGTTGTT 
GGGAAGCTAA 
ACTCCAGAAC 
ATTCTCAATC 
GACAAGTTAA 
AACAAGAAAA 
TTACGCTGAA 
GTGAAGAAAT 
CCACACAGCA 
TGGACGTCAT 
GTACAACTGA 
GTTATGCCTG 
CGCTATCGAA 
TTGGT 



TAGTATCTGC 50 

CTTTCTAAAA 100 

CGACATGGTA 150 

GCGATCTTCT 200 

AAAGGTTCTG 250 

AATCTTCGAA 300 

GCGACACTGA 350 

ACTGGTCGTG 4 00 

AGTCGGTGAC 4 50 

CAACTGTTAC 500 

GCTGGTGACA 550 

CCAACGTGGT 600 

AATTCAAAGC 650 

ACTCCATTCT 700 

CGTAACTGGT 7 50 

GAGATAACAC 800 

GAAGGAACTC 850 
885 
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2) INFORMATION FOR SEQ ID NO: 1736 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 882 bases 

(B) TYPE: Nucleic acid 

(C) STRANDEDNESS: Double 

(D) TOPOLOGY: Linear 

(ii) MOLECULE TYPE: Genomic DNA 



(vi) ORIGINAL SOURCE: 

(A) ORGANISM: BacteroidBs frstgilis 
35 (B) STRAIN: DSM 2151 

(C) ACCESSION NUMBER: P33165 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 1736 



4 0 ATGGTTACTG 
TACTGATGGT 
AGGTAAACGT 
GAAGATGCTG 
TTCATTCTAT 

4 5 CTCTTGGTGC 
CTGATGGAAG 
TAAACCTTTC 
GTACTGTAGC 
GAAATCGAAA 

50 TGTTGAAATG 
TAGGTCTGTT 
GTTCTTTGTA 
GGTTTATATC 



GTGCTGCTCA 
CCGATGCCTC 
TCCGAAGCTG 
AGATGTTGGA 
GATTTCGACG 
ATTGAACGGC 
CTGTTGATAC 
TTGATGCCGG 
TACAGGTCGT 
TCCTCGGTTT 
TTCCGCAAAC 
GCTTCGTGGT 
AACCGGGTCA 
CTGAAGAAAG 



GATGGACGGT 
AGACTCGTGA 
GTTGTATTCA 
ACTTGTTGAA 
GTGACAATAC 
GTAGAAAAAT 
TTGGATTCCA 
TAGAAGACGT 
ATCGAAACTG 
GGGTGAAGAT 
TTCTGGATCA 
GTTGACAAGA 
GATTAAACCT 
AAGAAGGTGG 



GCTATCATTG 
GCACATCCTT 
TGAACAAGTG 
ATGGAAATGA 
TCCGATCATT 
GGGAAGACAA 
CTGCCTCCGC 
GTTCTCTATC 
GTGTTATCCA 
AAGAAATCAG 
GGGTGAAGCT 
ACGAAATCAA 
CACTCTAAAT 
TCGTCACACT 



TAGTTGCTGC 50 

TTGGCTCGTC 100 

CGATATGGTT 150 

GAGAATTGCT 200 

CAGGGTTCTG 250 

AGTAATGGAA 300 

GCGATGTTGA 350 

ACAGGTCGTG 4 00 

TGTAGGTGAT 450 

TTGTAACAGG 500 

GGTGACAACG 550 

ACGTGGTATG 600 

TCAAAGCAGA 650 

CCATTCCATA 700 
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ACAAATATCG TCCTCAGTTC TACCTGCGTA CTATGGACTG TACAGGTGAA 750 

ATCACTCTTC CGGAAGGAAC TGAAATGGTA ATGCCGGGTG ATT^CGTAAC 800 

TATCACTGTA GAGTTGATCT ATCCGGTTGC ACTGAACATC GGTCTTCGTT 850 

TCGCTATCCG CGAAGGTGGA CGTACAGTAG GT 882 



2) INFORMATION FOR SEQ ID NO: 1737 



10 (i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 888 bases 

(B) TYPE: Nucleic acid 

(C) STRANDEDNESS: Double 

(D) TOPOLOGY: Linear 

15 

(ii) MOLECULE TYPE: Genomic DNA 

(vi) ORIGINAL SOURCE: 

(A) ORGANISM: Borrelia burgdorferi 
20 (B) STRAIN: U78183 

(C) ACCESSION NUMBER: U7 8183 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 1737 



25 AATATGATTA 
TGCTGATAGT 
AAAGAATGGG 
GCAGATCCTG 
TGAAAAATAT 

30 TTGGGGCTAT 
GAACTTCTTG 
TGACAAGCCA 
GAGGCACTGT 
CAAGT^GTTG 

35 TGGTGTTGAA 
ATGTTGGTCT 
CAAGTTTTGT 
TTCAATTTAT 
TCCCAGGGTA 

40 GTTGTTGCTT 
TATTATTGTT 
TTGCTGTTCG 



CAGGAGCAGC 
GGTGCTGAGC 
AATAAAGAAA 
AACTTGTTGA 
GGCTTTTCAG 
GTCAAATCCA 
AATCTATGGA 
TTTTTGCTTG 
TGCTACTGGG 
AAATAGTTGG 
ATGTTCCAGA 
TCTTTTGAGA 
CAGCTCCAGG 
TGTTTGACTA 
TAGACCACAG 
TAGAGGGCAA 
GAGCTGATCT 
AGAAGGTGGA 



TCAAATGGAT 
CTCAAACAAA 
ATAATAGTTT 
GCTTGTTGAA 
CTGATACTCC 
GAAGATCCTG 
TAATTATTTT 
CTGTTGAAGA 
CGTATTGAAA 
AATTAAAGAA 
AAATTCTTGA 
GGCGTTGATA 
TACAATTACT 
AAGAAGAAGG 
TTCTTTTTTA 
AGAAATGGTT 
CTTCAATAGC 
AGAACCGTTG 



GCAGCGATAC 
AGAGCATTTG 
TTTTAAATAA 
GTTGAAGTTT 
AATAATCAAA 
AATCTACAAA 
GATCTTCCAG 
TGTATTTTCT 
GAGGTATTAT 
ACCAGAAAAA 
GCAAGGTCAA 
AAAAAGACAT 
CCACACAAGA 
CGGTAGGCAC 
GAACAACCGA 
ATGCCTGGTG 
TATGGATAAG 
CTTCAGGA 



TTTTAGTTGC 50 

CTTCTTGCTC 100 

ATTGGACTTA 150 

TAGAACTTGT 200 

GGTTCAGCTT 250 

ATGCGTTAAA 300 

AAAGAGATAT 350 

ATTTCAGGAA 400 

TAAAGTTGGT 450 

CTACTGTTAC 500 

GCAGGGGATA 550 

TGAGAGGGGG 600 

AATTTAAAGC 650 

AAGCCATTTT 700 

TGTTACTGGA 7 50 

ATAATGTTGA 800 

AATGTAGAAT 850 

888 



45 

2) INFORMATION FOR SEQ ID NO: 1738 

(i) SEQUENCE CHARACTERISTICS: 
(A) LENGTH: 894 bases 
50 (B) TYPE: Nucleic acid 

(C) STRANDEDNESS: Double 

(D) TOPOLOGY: Linear 
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(ii) MOLECULE TYPE: Genomic DNA 



PCT/CAOO/01150 



(vi) ORIGINAL SOURCE: 

(A) ORGANISM: Brevibacterium linens 

(B) STRAIN: DSM 20425 

(C) ACCESSION NUMBER: X76863 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 1738 



10 AACATGATCA 
CGCTACCGAC 
GTCAGGTCGG 
GTCGATGACG 
GCTCTCGAGC 

15 CCGCTCTCAA 
GATCTCATGG 
CGACAAGCCG 
GTGGAACCGT 
GACGAAATCG 

20 CACCGCTATC 
AGAACGTCGG 
GGTCAGGTCA 
GGCTCAGGTC 
TCTACTCGAA 

25 GGTGTCATCA 
CACCGATATG 
TCCGCTTCGC 



CCGGTGCCGC 
GGACCGATGC 
CGTTCCCTAC 
AGGAGCTCCT 
CAGGACTTCG 
GGCGCTGGAA 
CTGCCGTCGA 
TTCCTCATGC 
CGTCACCGGT 
AAATCGTCGG 
GAGATGTTCC 
TCTGCTCCTC 
TCGTGAAGCC 
TACATCCTGA 
CTACCGTCCG 
CGCTGCCCGA 
TCGGTCGAGC 
AATCCGCGAA 



TCAGATGGAC 
CCCAGACCCG 
ATCGTCGTGG 
CGAGCTCGTC 
ACGGAGACAA 
GGCGACGAGA 
TGACAACGTT 
CCGTCGAGGA 
CGTGTCGAGC 
CATCAAGGAG 
GCAAGACCCT 
CGCGGCACCA 
GGGTTCGATC 
GCAAGGACGA 
CAGTTCTACT 
GGGCACCGAG 
TCATCCAGCC 
GGTGGCCGCA 



GGTGCGATCC 
TGAGCACGTG 
CTCTGAACAA 
GAATTCGAGG 
CGCTCCGGTC 
AGTGGGTCAA 
CCGGAGCCGG 
CGTCTTCACG 
GCGGCGTGCT 
AAGTCGTCCA 
GCCGGATGCC 
AGCGCGAGGA 
ACCCCGCACA 
GGGCGGACGT 
TCCGGACCAC 
ATGGTCATGC 
GATCGCTATG 
CCGTCGGCGC 



TCGTCGTCGC 50 

CTGCTCGCGC 100 

GTCCGACATG 150 

TCCGCGACCT 200 

ATTCCGGTGT 250 

GAGCGTTCAG 300 

AGCGCGATGT 350 

ATCACCGGTC 400 

CCTGCCTAAC 450 

AGACGACTGT 500 

CGTGCAGGTG 550 

TGTTGAGCGC 600 

CCAAGTTCGA 650 

CACAACCCGT 700 

GGACGTCACC 750 

CCGGCGACAA 800 

GAGGACCGCC 850 

CGGT 894 



30 



2) INFORMATION FOR SEQ ID NO: 1739 



(i) SEQUENCE CHARACTERISTICS: 
(A) LENGTH: 888 bases 
35 (B) TYPE: Nucleic acid 

(C) STRANDEDNESS : Double 

(D) TOPOLOGY: Linear 



(ii) MOLECULE TYPE: Genomic DNA 

40 

(vi) ORIGINAL SOURCE: 

(A) ORGANISM: Chlamydia trachomatis 

(B) STRAIN: F/IC-Cal-13 

(C) ACCESSION NUMBER: L22216 

45 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 1739 

ATGATCACGG GCGCAGCGCA GATGGACGGC GCGATCCTGG TTTGCTCGGC 50 

AGCAGACGGC CCGATGCCGC AAACGCGTGA GCACATCCTG CTGGCGCGTC 100 

50 AGGTTGGTGT TCCGTACATC ATCGTGTTCC TGAACAAGTG CGACAGTGTG 150 

GACGACGCTG AACTGCTCGA GCTGGTCGAG ATGGAAGTTC GCGAACTCCT 200 

GTCGAAGTAC GACTTCCCGG GCGACGACAC GCCGATCGTG AAGGGTTCGG 250 

CCAAGCTGGC GCTGGAAGGC GACACGGGCG AGCTGGGCGA AGTGGCGATC 300 

906 
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ATGAGCCTGG 
AGTTGACGGC 
GCCGTGGTAC 
GGCGAAGAAA 
CACGGGCGTT 
ACAACGTCGG 
GGCCAGGTTC 
GGCTGAAGTG 
TCTTCAACAA 
GGCTCGATCG 
CGTGTCGATC 
TGCGCTTCGC 



CAGACGCGCT 
GCGTTCCTGA 
GGTGGTGACG 
TCGAAATCGT 
GAAATGTTCC 
TATCCTGCTG 
TGGCGAAGCC 
TACGTGCTGA 
CTACCGTCCG 
AGCTGCCGAA 
ACGGTGAAGC 
AATCCGTGAA 



GGACACGTAC 
TGCCGGTGGA 
GGTCGTGTCG 
CGGTATCAAG 
GCAAGCTGCT 
CGCGGCACGA 
GGGTTCGATC 
GCAAGGACGA 
CAGTTCTACT 
GGACAAGGAA 
TGATTGCTCC 
GGCGGCCGTA 



ATCCCGACGC 
AGACGTGTTC 
AGCGCGGCAT 
CCGACGGTGA 
GGACCAAGGT 
AGCGTGAAGA 
ACGCCGCACA 
AGGCGGCCGT 
TCCGTACGAC 
ATGGTGATGC 
GATCGCGATG 
CGGTCGGC 



CGGAGCGTGC 350 

TCGATCTCGG 4 00 

CGTGAAGGTC 4 50 

AGACGACCTG 500 

CAGGCAGGCG 550 

CGTGGAGCGT 600 

CGCACTTCAC 650 

CACACGCCGT 700 

GGACGTGACG 750 

CGGGCGACAA 800 

GAAGAAGGTC 850 

888 



15 



20 



2) INFORMATION FOR SEQ ID NO: 1740 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 891 bases 

(B) TYPE: Nucleic acid 

(C) STRANDEDNESS: Double 

(D) TOPOLOGY: Linear 



25 



30 



(ii) MOLECULE TYPE: Genomic DNA 

(vi) ORIGINAL SOURCE: 

(A) ORGANISM: Fibrobacter succlnogenes 

(B) STRAIN: S85 

(C) ACCESSION NUMBER: X76866 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 1740 



AACATGGTGA 
CGCTACTGAC 

35 ACCAGGTTGG 
GTTGACGATG 
CCTCTCCAAG 
CCGCTCTCAA 
GAACTCATGA 

40 CGAC7UVGCCG 
GCGGCACTGT 
GACAAGGTTG 
CGGTGTTGAA 
ACGTTGGTCT 

45 ATGGTTCTCG 
TGAAATCTAC 
TGAATGGCTA 
ACGATCCAGC 
CACGATCCAC 

50 GCTTCGCTAT 



CTGGTGCTGC 
GGTCCGATGC 
CGTGCCGAAG 
CTGAAATTCT 
TATGACTTCG 
GGCCCTCGAA 
ACGCTTGCGA 
TTCCTCATGC 
CGCTACTGGC 
AACGTATCGG 
ATGTTCCGTA 
CCTCCTCCGT 
CAGCTCCGAA 
GTTCTCACGA 
CCGTCCGCAG 
TCCCGGAAGG 
GTGAACCTCA 
CCGTGAAGGT 



TCAGATGGAC 
CGCAGACTCG 
ATCGTCGTGT 
CGACCTCGTC 
ACGGTGACAA 
GGCGATCCGG 
CGAATACATC 
CGATCGAAGA 
CGTATCGAAC 
TCTCGGTGAA 
AGCTCCTCGA 
GGTGCTGAAA 
GTCTGTCACT 
AGGACGAAGG 
TTCTACTTCC 
TGTCGAAATG 
TCGCTCCGAT 
GGACGTACTG 



GGCGCTATCC 
CGAACACATC 
TCATGAACAA 
GAAATGGAAG 
CACCCCGATC 
AATACCAGGA 
CCGCTCCCGC 
CGTGTTCACG 
GCGGTGTCGT 
ACCACCGAAT 
CGACGCTCAG 
AGAAGGACAT 
CCGCACACCG 
TGGCCGTCAC 
GCACCACCGA 
GTTACTCCGG 
CGCTATGGAA 
TTGGTGCTGG 



TCGTTGTTGC 50 

CTTCTCGCTC 100 

GTGCGACATG 150 

TTCGCGAACT 200 

ATCCGTGGTT 250 

CAAGGTCATG 300 

AGCGCGATAC 350 

ATTACTGGCC 400 

TCGCTTGAAC 4 50 

ACGTCATCAC 500 

GCAGGTGACA 550 

CGTCCGTGGC 600 

AATTTAAGGC 650 

ACGCCGTTCA 700 

CGTTACTGGT 750 

GTGACACGGT 800 

AAGCAGCTCC 850 

C 891 
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2) INFORMATION FOR SEQ ID NO: 1741 

(i) SEQUENCE CHAflACTERISTICS : 

(A) LENGTH: 894 bases 

(B) TYPE: Nucleic acid 

(C) STRANDEDNESS : Double 

(D) TOPOLOGY: Linear 

(ii) MOLECULE TYPE: Genomic DNA 

(vi) ORIGINAL SOURCE: 

(A) ORGANISM: Flavobacterium ferrugineum 

(B) STRAIN: DSM 13524 

(C) ACCESSION NUMBER: X76867 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 1741 



AACATGATCA 
TGCATCAGAC 

20 CCCAGGTAGG 
GTTGACGACG 
ACTGACTAAA 
CCGCTACAGG 
AACCTGATGG 

25 TGATCTGCCG 
GTGGTACTGT 
GAGCCTGTTG 
TACAGGTGTT 
ATAACGCCGG 

30 GGTATGGTAA 
AGGCGAAGTT 
TCTTCAACAA 
GGTGAAGTAG 
CACCAACCTG 

35 TGAAATTCGC 



CCGGTGCTGC 
GGTCCTATGC 
TGTACCTAAA 
AAGAGCTCCT 
CGCGGTTTCG 
CGCCCTCGCT 
ACGCTGTTGA 
TTCCTGATGA 
TGCTACCGGT 
AGATCGTAGG 
GAGATGTTCC 
TCTCCTCCTC 
TCGTTAAACC 
TACGTACTGA 
ATACCGTCCT 
AACTGAACGC 
ACCGTTAAAC 
GATCCGCGAA 



CCAGATGGAC 
CTCAAACAAA 
ATGGTTGTGT 
GGAGCTGGTT 
ACGGCGACAA 
GGTGAAGAAA 
CAGCTACATC 
GCGTAGAGGA 
CGTATCGAGC 
TCTGCAGGAG 
GCAAACTCCT 
CGTGGTGTTG 
CGGTTCCATC 
GCAAAGACGA 
CAATTCTACT 
AGGAACAGAA 
TGATCCAACC 
GGTGGCCGTA 



GGTGCTATCT 
AGAACACATC 
TTCTGAATAA 
GAGATCGAGG 
CACTCCAATC 
AGTGGGTTAA 
CCACTGCCTC 
CGTATTCTCT 
GTGGCCGTAT 
TCTCCCCTGA 
CGACGAAGGT 
AAAAAACACA 
ACTCCGCACA 
AGGTGGCCGT 
TCCGTACAAC 
ATGGTTATGC 
GATCGCTATG 
CCGTAGGTGC 



TAGTTGTGGC 50 

CTGCTTGCTG 100 

AGTTGACCTC 150 

TTCGCGAAGA 200 

ATCAAAGGTT 250 

AGAAATTGAA 300 

CTCGTCCGGT 350 

ATCACTGGTC 4 00 

CAAAGTTGGT 450 

ACTCTACCGT 500 

GAAGCTGGTG 550 

GATCCGTCGC 600 

CGGACTTCAA 650 

CACACTCCAT 700 

TGACGTTACA 750 

CTGGTGATAA 800 

GAA7\AAGGTC 850 

AGGA 894 



2) INFORMATION FOR SEQ ID NO: 1742 

40 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 906 bases 

(B) TYPE: Nucleic acid 

(C) STRANDEDNESS: Double 
45 (D) TOPOLOGY: Linear 

(ii) MOLECULE TYPE: Genomic DNA 

(vi) ORIGINAL SOURCE: 
50 (A) ORGANISM: Helicobacter pylori 

(B) STRAIN: 26695 

(C) ACCESSION NUMBER: AE000626 
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(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 1742 



AACATGATCA 
TGCAGCTGAT 
5 GTCAAGTAGG 
GTAGATGACC 
GTTGAGCGCG 
CAGCTTTTJ^G 
GGTGAAAAAG 

10 TCCAGAAAGA 
TCTCTATTGC 
GTGGTGAAAG 
AAAAACGACT 
GTGAAGCCGG 

15 GAAGTGGAAC 
CAAGAAATTT 
GACACACTCC 
ACTGATGTGA 
GCCTGGCGAT 

2 0 TAGAGTTGGG 
GCTGGT 



CCGGTGCGGC 
GGCCCTATGC 
CGTGCCTCAC 
T^AGAATTGTT 
TATGAATTTC 
AGCTTTAGAA 
TGCTTAAACT 
GACACTGAAA 
GGGTAGAGGG 
TAGGCGATGA 
GTAACCGGTG 
CGATAATGTG 
GCGGTATGGT 
GAGGGAGAAA 
ATTCTTCACC 
CTGGCTCTAT 
AATGTGAAAA 
AACTAAATTT 



GCAAATGGAC 
CTCAAACTAG 
ATCGTTGTTT 
AGAACTTGTA 
CTGGCGATGA 
GAAGCAAAGG 
TATGGCTGAA 
AAACTTTCTT 
ACTGTGGTTA 
AGTGGAAATC 
TAGAAATGTT 
GGCGTGCTTT 
TCTATGCAAA 
TTTATGTCCT 
AATTACCGCC 
CACCCTTCCT 
TCACTGTAGA 
GCGATTCGTG 



GGAGCGATTT 
GGAGCATATC 
TCTTAAACAA 
GAAATGGAAG 
CACTCCTATC 
CTGGTAATGT 
GTGGATGCCT 
GATGCCGGTT 
CAGGTAGGAT 
GTTGGTATCA 
TAGGAAAGAG 
TGAGAGGAAC 
CCAGGTTCTA 
TTCTAAAGAA 
CGCAATTCTA 
GAAGGCGTAG 
GTTGATTAGC 
AAGGCGGTAG 



TGGTTGTTTC 50 

TTATTGTCTC 100 

ACAAGACATG 150 

TGCGCGAATT 200 

GTAGCGGGTT 250 

GGGTGAATGG 300 

ATATCCCTAC 350 

GAAGATGTGT 4 00 

TGAAAGAGGC 4 50 

GACCTACACA 500 

TTGGAAAAAG 550 

TAAAAAAGAA 600 

TCACTCCGCA 650 

GAAGGCGGGA 700 

TGTGCGCACA 750 

AAATGGTTAT 800 

CCTGTTGCGT 850 

GACCGTTGGT 900 

906 



25 2) INFOEIMATION FOR SEQ ID NO: 1743 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 8 91 bases 

(B) TYPE: Nucleic acid 
30 (C) STRANDEDNESS: Double 

(D) TOPOLOGY: Linear 

(ii) MOLECULE TYPE: Genomic DNA 

35 (vi) ORIGINAL SOURCE: 

(A) ORGANISM: Micrococcus luteus 

(B) STRAIN: IFO 3333 

(C) ACCESSION NUMBER: M17788 



40 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 1743 



AACATGATCA 
CGCTACCGAC 
GCCAGGTCGG 

45 GTGGAGGACG 
GCTGTCCTCC 
CCGCTCTGAA 
GACCTCATGG 
GGACAAGCCG 

50 GTGGCACCGT 
TCCGAGGTCG 
CGGCATCGAG 
ACTGCGGTCT 



CCGGCGCCGC 
GGCCCGATGG 
CGTGCCGGCC 
AGGAGCTCCT 
AGGAGCTTCG 
GGCCCTCGAG 
ATGCCGTGGA 
TTCCTGATGC 
GGTGACCGGT 
AGATCGTCGG 
ATGTTCCACA 
GCTCGTGCGC 



TCAGATGGAC 
CCCAGACCCG 
CTGCTCGTGG 
CGAGCGTGTC 
ACGTCGACGA 
GGCGACCCCC 
CGAGTACATC 
CGATCGAGGA 
CGCGCCGAGC 
CATCCGCGAC 
AGCAGCTCGA 
GGTCTGAAGC 



GGCGCGATCC 
TGAGCACGTG 
CCCTGAACAA 
GAGATGGAGG 
GGCCCCGGTC 
AGTGGGTCAA 
CCGGACCCGG 
CGTCTTCACG 
GCGGCACCCT 
GTGCAGAAGA 
CGAGGCCTGG 
GCGACGACGT 



TCGTGGTCGC 50 

CTCCTGGCCC 100 

GTCGGACATG 150 

TCCGGCAGCT 200 

ATCCGCACCT 250 

GTCCGTCGAG 300 

TGCGCGACAA 350 

ATCACCGGCC 4 00 

GAAGATCAAC 4 50 

CCACTGTCAC 500 

GCCGGCGAGA 550 

CGAGCGCGGC 600 
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CAGGTGCTGG TGGAGCCGGG 

GAACGTCTAC ATCCTGTCCA 

ACTCGAACTA CCGCGCGCAG 

GTCATCACGC TGCCCGAGGG 

5 CGAGATGTCG GTCGAGCTCA 

GCTTCGCCAT CCGCGAGGGT 



CTCCATCACC CCGCACACCA 

AGGACGAGGG TGGGCGTCAC 

TTCTACTTCC GCACCACCGA 

CACCGAGATG GTCATGCCCG 

TCCAGCCGAT CGCCATGGAG 

GGCCGCACCG TGGGCTCCGG 



ACTTCGAGGC 650 

ACCCCGTTCT 700 

CGTCACCGGC 750 

GCGACACCAC 800 

GAGGGCCTCG 850 

C 891 



10 2) INFORMATION FOR SEQ ID NO: 1744 
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(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 8 91 bases 

(B) TYPE: Nucleic acid 

(C) STRANDEDNESS: Double 

(D) TOPOLOGY: Linear 



(ii) MOLECULE TYPE: Genomic DNA 



20 



25 



30 



35 



40 



(vi) ORIGINAL SOURCE: 

(A) ORGANISM: Mycobacterium tuberculosis 

(B) STRAIN: Erdmann- 

(C) ACCESSION NUMBER: X63539 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 1744 



AACATGATCA 
CGCCACCGAC 
GTCAAGTGGG 
GTGGACGACG 
GCTGGCTGCC 
CGCTCAAGGC 
CTGATGAACG 
CAAGCCGTTC 
GAACCGTGGT 
GAAGTTGAGA 
CGGTGTGGAG 
ACGTTGGTTT 
CAGGTTGTCA 
CCAGGTCTAC 
TCAACAACTA 
GTGGTGACAC 
CAACATCTCG 
GTTTCGCGAT 



CCGGCGCCGC 
GGCCCGATGC 
TGTGCCCTAC 
AGGAGCTGCT 
CAGGAATTCG 
GCTCGAGGGT 
CGGTCGACGA 
CTGATGCCGG 
CACCGGACGT 
TCGTCGGCAT 
ATGTTCCGCA 
GCTGCTGCGG 
CCAAGCCCGG 
ATCCTGTCCA 
CCGTCCGCAG 
TGCCGGAGGG 
GTGAAGTTGA 
CCGCGAGGGT 



GCAGATGGAC 
CCCAGACCCG 
ATCCTGGTAG 
CGAACTCGTC 
ACGAGGACGC 
GACGCGAAGT 
GTCGATTCCG 
TCGAGGACGT 
GTGGAGCGCG 
TCGCCCATCG 
AGCTGCTCGA 
GGCGTCAAGC 
CACCACCACG 
AGGACGAGGG 
TTCTACTTCC 
CACCGAGATG 
TCCAGCCCGT 
GGCCGCACCG 



GGTGCGATCC 
CGAGCACGTT 
CGCTGAACAA 
GAGATGGAGG 
CCCGGTTGTG 
GGGTTGCCTC 
GACCCGGTCC 
CTTCACCATT 
GCGTGATCAA 
ACCACCAAGA 
CCAGGGCCAG 
GCGAGGACGT 
CCGCACACCG 
CGGCCGGCAC 
GCACCACCGA 
GTGATGCCCG 
CGCCATGGAC 
TGGGCGCCGG 



45 



TGGTGGTCGC 50 

CTGCTGGCGC 100 

GGCCGACGCA 150 

TCCGCGAGCT 200 

CGGGTCTCGG 250 

TGTCGAGGAA 300 

GCGAGACCGA 350 

ACCGGCCGCG 4 00 

CGTGAACGAG 4 50 

CCACCGTCAC 500 

GCGGGCGACA 550 

CGAGCGTGGC 600 

AGTTCGAAGG 650 

ACGCCGTTCT 7 00 

CGTGACCGGT 750 

GTGACAACAC 800 

GAAGGTCTGC 850 

C 891 



2) INFORMATION FOR SEQ ID NO: 1745 

50 (i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 891 bases 

(B) TYPE: Nucleic acid 

(C) STRANDEDNESS: Double 

910 
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(D) TOPOLOGY: Linear 



PCT/CAOO/01150 



(ii) MOLECULE TYPE: Genomic DNA 

(vi) ORIGINAL SOURCE: 

(A) ORGANISM: Mycoplasma genitalium 

(B) STRAIN: G37 

(C) ACCESSION NUMBER: U397 32 



10 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 1745 



AATATGATCA 
AGCAACTGAT 
GCCAAGTAGG 

15 GCTAGTGATG 
GTTAACTTCC 
CAGCTTTAAA 
GATTTGATTA 
AGATAAACCT 

20 GAGGTACAGT 
CAAGAAGTTG 
TGGAATTGAA 
ATGCTGGGGT 
CAAGTTTTAG 

25 TGAGATCTAT 
TAAACGGTTA 
TCTATTGCTT 
TTCTATTACT 
AGTTCTCAAT 

30 



CAGGTGCTGC 
AGTGTGATGC 
GGTTCCTAAA 
AAGAGGTACA 
TATGGTTTTG 
AGCATTGGAA 
AAGCAGTTGA 
TTCTTATTAG 
TGTTACAGGA 
AAATTGTTGG 
ATGTTCAAAA 
ATTATTACGT 
CAAAACCAGG 
GCTTTAAAGA 
CCGTCCTCAA 
TAGCTGAAAA 
GTTGAGTTAA 
TCGTGAAGGT 



ACAAATGGAT 
CCCAAACCCG 
ATGGTAGTTT 
AGAACTTGTT 
ATGGTAAGAA 
GGTGATCCAA 
TGAATGGATT 
CAATTGAAGA 
AGAGTTGAAA 
TTTAAAACCA 
AGGAACTTGA 
GGTGTTGAAC 
CTCTATTAAA 
AAGAAGAAGG 
TTCTATTTCC 
TACTGAAATG 
TTGCTCCTAT 
GGTAGAACTG 



GGAGCTATTC 
CGAGCACATC 
TTCTAAACAA 
GCTGAAGAAG 
CACTCCTATT 
AGTGGGAGGC 
CCAACTCCTA 
TACGATGACC 
GAGGTGAACT 
ATTAGAAAAG 
TTCAGCAATG 
GTAAAGAAGT 
CCGCACAAGA 
TGGTAGACAC 
GTACCACTGA 
GTTCTACCTG 
CGCTTGTGAA 
TAGGGGCAGG 



TAGTTGTTTC 50 

TTACTTGCCC 100 

GTGTGATATT 150 

TACGTGATCT 200 

ATTTATGGCT 250 

TAAGATCCAT 300 

CACGTGAAGT 350 

ATTACTGGTA 400 

CAAAGTAGGT 4 50 

CAGTTGTTAC 500 

GCTGGTGACA 550 

TGAAAGAGGT 600 

AATTTAAAGC 650 

ACTGGTTTTT 700 

TGTAACTGGT 7 50 

GTGATAATGC 800 

AAAGGTAGTA 850 

C 891 



2) INFORMATION FOR SEQ ID NO: 1746 

35 (i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 891 bases 

(B) TYPE: Nucleic acid 

(C) STRANDEDNESS: Double 

(D) TOPOLOGY: Linear 



40 



(ii) MOLECULE TYPE: Genomic DNA 



(vi) ORIGINAL SOURCE: 

(A) ORGANISM: Neisseria gonorrhoeae 
45 (B) STRAIN: MSll 

(C) ACCESSION NUMBER: L36380 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 1746 

50 AACATGATTA CCGGCGCCGC ACAAATGGAC GGTGCAATCC TGGTATGTTC 50 

TGCTGCCGAC GGCCCTATGC CGCAAACCCG CGAACACATC CTGCTGGCCC 100 

GTCAAGTAGG CGTACCTTAC ATCATCGTGT TCATGAACAA ATGCGACATG 150 

GTCGACGATG CCGAGCTGTT CCAACTGGTT GAAATGGAAA TCCGCGACCT 200 
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SUBSTITUTE SHEET (RULE 26) 



wo 01/23604 



PCT/CAOO/01150 



10 



GCTGTCCAGC 
CCGCACTGAA 
GAACTGGCTA 
GGACAAACCA 
GCGGTACCGT 
GACGAGATTG 
CGGCGTTGAA 
ACGTAGGCGT 
CAGGTATTGG 
AGAAGTGTAC 
TCGCCAACTA 
ACGATTACTT 
AACCATTACT 
GCTTTGCGAT 



TACGACTTCC 
AGCCTTGGAA 
CCGCATTGGA 
TTCCTGCTGC 
AGTCACCGGC 
AAATCGTCGG 
ATGTTCCGCA 
ATTGCTGCGC 
CCAAACGGGG 
GTATTGAGCA 
CCGTCCCCAA 
TGGAAAAAGG 
GTAGAACTGA 
TCGCGAAGGC 



CCGGCGACGA 
GGCGATGCCG 
CAGATACATC 
CTATCGAAGA 
CGTGTAGAGC 
TCTGAAAGAA 
AACTGCTGGA 
GGTACCAAAC 
TACTATCACT 
AAGAAGAGGG 
TTCTACTTCC 
TGTGGAAATG 
TTGCGCCTAT 
GGCCGTACCG 



CTGCCCGATC 
CTTACGAAGA 
CCGACTCCCG 
CGTGTTCTCC 
GAGGTATCAT 
ACCCAAAAAA 
CGAAGGTCAG 
GTGAAGACGT 
CCTCACACCA 
CGGCCCCCAT 
GTACCACTGA 
GTAATGCCGG 
CGCTATGGAA 
TGGGTGCCGG 



GTACAAGGTT 250 

AAAAATCTTC 300 

AGCGTGCCGT 350 

ATTTCCGGCC 4 00 

CCACGTTGGT 450 

CCACCTGTAC 500 

GCGGGCGACA 550 

AGAACGCGGT 600 

AGTTCPJKAGC 650 

ACCCCGTTTT 700 

CGTAACCGGC 750 

GTGAGAACGT 800 

GAAGGTCTGC 850 

C 891 



15 



2) INFORMATION FOR SEQ ID NO: 1747 



20 



25 



(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 891 bases 

(B) TYPE: Nucleic acid 

(C) STRANDEDNESS : Double 

(D) TOPOLOGY: Linear 

(ii) MOLECULE TYPE: Genomic DNA 



(vi) ORIGINAL SOURCE: 

(A) ORGANISM: Rickettsia prowazekii 
30 (B) STRAIN: Madrid E 

(C) ACCESSION NUMBER: Z54170 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 1747 



35 AATATGATAA 
TGCTGCTGAT 
AACAGGTAGG 
GTAGATGATC 
ATTATCAAAA 

4 0 CTGCACTTCA 
GAGTTAATGA 
AGATAAACCT 
GAGGTACCGT 
GAAGAAATTG 

45 AGGTGTAGAA 
ATGTCGGTAT 
CAAGTACTTG 
TGAAGTGTAT 
CTAATGATTA 

50 ACAATAAAAT 
TACTTTTTCA 
AATTCTCTAT 



CTGGTGCCGC 
GGTCCTATGC 
TGTACCTGCT 
CTGACCTATT 
TATGGTTTCC 
AGCTTTAGAA 
ATGCAGTAGA 
TTTTTAATGC 
TGTAACTGGT 
AT^TAGTAGG 
ATGTTCAGAA 
ATTACTACGT 
CAAAACCTGG 
GTGCTTAGTA 
TCGCCCACAG 
TGCCTTCTGA 
GTAGAATTAA 
ACGTGAAGGT 



TCAGATGGAT 
CTCAAACTAG 
ATGGTAGTAT 
AGAATTAGTT 
CTGGTAATGA 
GGAAT^CCTG 
TACGTATATA 
CAATAGAGGA 
AGAGTGGAGT 
TCTAAAAAAT 
AATTACTTGA 
GGTACAAAAA 
GAGCATAAAA 
AAGAGGAAGG 
TTCTATTTTA 
TAAGCAGATG 
TTAAGCCGAT 
GGTAGAACAG 



GGTGCTATAT 
AGAACATATA 
TTTTGAATAA 
GAGATGGAAG 
AATACCTATT 
AAGGTGAAAA 
CCTCAGCCTA 
TGTATTTTCT 
CAGGCATAAT 
ACGCAAAAAA 
TGAAGGACAA 
GAGAAGAAGT 
CCGCATGATA 
TGGACGTCAT 
GAACAACAGA 
GTTATGCCTG 
TGCTATGCAA 
TAGGAGCCGG 



TAGTAGTTTC 50 

TTACTGGCAA 100 

AGTAGATATG 150 

TAAGAGAATT 200 

ATTAAAGGTT 250 

AGCTATTAAT 300 

TAGAGCTACA 350 

ATTTCAGGCA 400 

TAAGGTGGGT 450 

CGACTTGTAC 500 

TCTGGAGATA 550 

AGAAAGAGGA 600 

7WVTTTGAAGC 650 

ACCCCATTTA 700 

TGTTACCGGC 750 

GAGATAATGC 800 

GAAGGGTTAA 850 

T 891 



912 



SUBSTITUTE SHEET (RULE 26) 



wo 01/23604 



PCT/CAOO/01150 



10 



15 



2) INFORMATION FOR SEQ ID NO: 1748 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 891 bases 

(B) TYPE: Nucleic acid 

(C) STRANDEDNESS: Double 

(D) TOPOLOGY: Linear 

(ii) MOLECULE TYPE: Genomic DNA 

(vi) ORIGINAL SOURCE: 

(A) ORGANISM: Salmonella choleraesuis subsp. choleraesuis 
serotype Typhimurium 

(B) STRAIN: LT2 trpE91 

(C) ACCESSION NUMBER: X55116 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 1748 



AACATGATCA 

20 TGCGACTGAC 
GTCAGGTAGG 
GTTGATGACG 
GCTGTCTCAG 
CTGCTCTGAA 

25 GAACTGGCTG 
TGACAAGCCG 
GTGGTACCGT 
GAAGAAGTTG 
TGGCGTTGAA 

3 0 ACGTAGGTGT 
CAGGTACTGG 
TGAAGTGTAC 
TCAAAGGCTA 
ACCATCGAAC 

35 CAAAATGGTT 
GTTTCGCAAT 



CCGGTGCTGC 
GGCCCGATGC 
CGTTCCGTAC 
AAGAGCTGCT 
TACGACTTCC 
AGCGCTGGAA 
GCTTCCTGGA 
TTCCTGCTGC 
TGTTACCGGT 
AAATCGTTGG 
ATGTTCCGCA 
TCTGCTGCGT 
CTAAGCCGGG 
ATTCTGTCCA 
CCGTCCGCAG 
TGCCGGAAGG 
GTTACCCTGA 
CCGTGAAGGC 



TCAGATGGAC 
CGCAGACCCG 
ATCATCGTGT 
GGAACTGGTT 
CGGGCGACGA 
GGCGACGCAG 
TTCTTATATT 
CGATCGAAGA 
CGTGTAGAGC 
TATCAAAGAG 
AACTGCTGGA 
GGTATCAAAC 
CACCATCAAG 
AAGATGAAGG 
TTCTACTTCC 
CGTAGAGATG 
TCCACCCGAT 
GGCCGTACCG 



GGCGCGATCC 
TGAGCACATC 
TCCTGAACAA 
GAGATGGAAG 
CACTCCGATC 
AGTGGGAAGC 
CCGGAACCAG 
CGTATTCTCC 
GCGGTATCAT 
ACTCAGAAGT 
CGAAGGCCGT 
GTGAAGAAAT 
CCGCACACCA 
CGGCCGTCAT 
GTACTACTGA 
GTAATGCCGG 
CGCGATGGAC 
TTGGCGCGGG 



TGGTTGTTGC 50 

CTGCTGGGTC 100 

ATGCGACATG 150 

TTCGCGAACT 200 

GTTCGTGGTT 250 

GAAAATCATC 300 

AGCGTGCGAT 350 

ATCTCCGGTC 400 

C/y^GTGGGC 450 

CTACCTGTAC 500 

GCCGGTGAGA 550 

CGAACGTGGT 600 

AGTTCGAATC 650 

ACTGCGTTCT 700 

CGTGACTGGT 7 50 

GCGACAACAT 800 

GACGGTCTGC 850 

C 891 



40 2) INFORMATION FOR SEQ ID NO: 1749 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 881 bases 

(B) TYPE: Nucleic acid 
45 (C) STRANDEDNESS: Double 

(D) TOPOLOGY: Linear 

(ii) MOLECULE TYPE: Genomic DNA 

50 (vi) ORIGINAL SOURCE: 

(A) ORGANISM: Shewanella putrefacxens 

(B) STRAIN: DSM 50426 



913 
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(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 1749 



ATGATCACTG 
AACAGACGGT 
5 AGGTTGGCGT 
GATGACGAAG 
ATCAGAATAC 
CTCTGAAAGC 
TTAGCAGCGG 

10 TAAGCCGTTC 
GTACAGTAGT 
GAAGTTGAAA 
TGTAGAAATG 
GTGGTATTTT 

15 GTATTAGCGA 
AGTTTACGTA 
AAGGCTACCG 
ATCGAACTGC 
GATGGTAGTG 

20 TCGCAATCCG 



GTGCTGCACA 
CCAATGCCAC 
ACCATTCATC 
AGCTGTTAGA 
GATTTCCCAG 
GCTAGAAGGC 
CGCTGGATTC 
CTACTGCCAA 
AACAGGTCGT 
TCGTTGGTGT 
TTCCGTAAAC 
GTTACGTGGT 
AGCCAGGTTC 
CTGTCAAAAG 
TCCACAGTTC 
CAGAAGGCGT 
ACACTGATTT 
TGAAGGCGGT 



GATGGACGGC 
AGACTCGTGA 
ATCGTATTCA 
GCTAGTTGAG 
GTGATGACTT 
GAGCCAGAGT 
TTACATTCCA 
TCGAAGACGT 
GTTGAGCGTG 
ACGTGCGACA 
TGCTTGACGA 
ACTAAGCGTG 
AATCAACCCA 
AAGAAGGTGG 
TACTTCCGTA 
AGAGATGGTA 
GCCCAATCGC 
CGTACAGTGG 



GCGATTCTGG 
GCACATCCTG 
TGAACAAATG 
ATGGAAGTGC 
ACCGGTAATC 
GGGAAGCAAA 
GAACCACAAC 
ATTCTCAATT 
GTATTGTACG 
ACTAAGACAA 
AGGTCGTGCA 
ATGACGTAGA 
CACACTACTT 
TCGTCACACG 
CAACTGACGT 
ATGCCAGGCG 
GATGGACGAA 
T 



TAGTCGCTTC 50 

CTTTCTCGTC 100 

TGACATGGTA 150 

GTGAACTGTT 200 

CAAGGTTCAG 250 

AATCCTTGAA 300 

GTGACATCGA 350 

TCAGGCCGTG 400 

CGTAGGCGAC 450 

CGTGTACTGG 500 

GGTGAGAACT 550 

ACGTGGTCAA 600 

TTGAATCAGA 650 

CCATTCTTCA 700 

AACCGGTACT 750 

ATAACATCAA 800 

GGTTTACGCT 8 50 

881 



25 



30 



2) INFORMATION FOR SEQ ID NO: 1750 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 897 bases 

(B) TYPE: Nucleic acid 

(C) STRANDEDNESS: Double 

(D) TOPOLOGY: Linear 



(ii) MOLECULE TYPE: Genomic DNA 

(vi) ORIGINAL SOURCE: 
35 (A) ORGANISM: Stigmatella aurantiaca 

(B) STRAIN: DW4 

(C) ACCESSION NUMBER: X82820 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 1750 



40 



45 



50 



AACATGATCA 
CGCGGCCGAC 
GGCAGGTGGG 
CTGGACGATC 
GCTCAAGAAG 
GCGCGCTCAA 
ATCCTGAAGC 
TGCGACGGAC 
CAGGCCGAGG 
GTGGGCGAGG 
CATCACGGGG 
GAGACAACAT 
CGTGGGCAGG 



CGGGCGCGGC 
GGCCCGATGC 
CGTGCCCTAC 
CGGAGCTGCG 
TACGAGTTCC 
GGCGCTGGAG 
TGATGGCGGC 
AAGCCGTTCC 
AACGGTGGCG 
AAGTGGAGAT 
GTGGAGATGT 
CGGAGCGCTG 
TGCTGGCGAA 



GCAGATGGAC 
CCCAGACGCG 
ATCGTCGTCT 
CGAGCTGGTG 
CGGGCGACAG 
GGAGACACCA 
GGTGGACGAG 
TGATGCCGGT 
ACGGGCCGAG 
CGTGGGGATC 
TCCGCAAGCT 
CTGCGAGGCC 
CTGGGGGAGC 



GGAGCGATTC 
TGAGCACATC 
TCCTGAACAA 
GAGATGGAGG 
CATCCCCATC 
GCGACATCGG 
TACATCCCGA 
GGAAGACGTG 
TGGAGCGCGG 
CGTCCGACGC 
GCTGGACGAG 
TGAAGCGCGA 
ATCAACCCGC 



TGGTGGTGTC 50 

CTGCTGGCCA 100 

GGTGGACATG 150 

TGCGCGACCT 200 

ATCCCTGGCA 250 

CGAGGGAGCG 300 

CGCCGCAGCG 350 

TTCTCCATCG 400 

CAAGATCAAG 4 50 

AGAAGACGGT 500 

GGCATGGCGG 550 

GGACCTGGAG 600 

ACACGAAGTT 650 
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CAAGGCGCAG 
CGTTCTTCAA 
ACCGGAACGG 
CAACATCGCC 
AGCTGCCGTT 



GTGTACGTGC 

GGGATACCGG 
TGAAGCTGCC 
ATCGAGGTGG 
CGCCATCCGT 



TGTCGAAGGA 
CCGCAGTTCT 
GGACAACGTG 
AGCTCATTAC 
GAGGGTGGCC 



AGAGGGAGGG 
ACTTCCGGAC 
GAGATGGTGA 
TCCGGTCGCC 
GCACGGTGGG 



CGGCACACGC 700 

GACGGACGTG 750 

TGCCGGGAGA 800 

ATGGAGAAGG 850 

CGCCGGC 897 



10 



15 



2) INFORMATION FOR SEQ ID NO: 1751 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 897 bases 

(B) TYPE: Nucleic acid 

(C) STRANDEDNESS : Double 

(D) TOPOLOGY: Linear 



(ii) MOLECULE TYPE: Genomic DNA 

(vi) ORIGINAL SOURCE: 
20 (A) ORGANISM: Thiomonas cuprina 

(B) STRAIN: Hoe5 

(C) ACCESSION NUMBER: x76871 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 1751 



25 



30 



35 



AACATGATCA 
CGCCGCCGAC 
GTCAGGTGGG 
GTCGACGACG 
GCTGTCCAAG 
CGGCCAAGCT 
ATTCTCAAGC 
GGCCGTCGAC 
CCGGGCGCGG 
GTCGGCGAGG 
CTGCACCGGC 
GCGACAACGT 
CGCGGCCAGG 
CACCGCCGAG 
CCTTCTTCAA 
ACCGGCGCCA 
TAATGTGAGC 
GCCTGCGCTT 



CCGGTGCGGC 
GGCCCCATGC 
CGTGCCCTAC 
CCGAGCTGCT 
TACGACTTCC 
GGCCCTCGAA 
TGGCCGAGGC 
GGCGCGTTCC 
CACGGTGGTC 
AAATCGAGAT 
GTGGAAATGT 
CGGCATCTTG 
TGCTGTGCAA 
GTGTACGTGC 
CAACTACCGC 
TCGAACTGCC 
ATCACCGTCA 
CGCCATCCGC 



CCAGATGGAC 
CCCAAACCCG 
ATCATCGTGT 
CGAACTCGTC 
CCGGTGACGA 
GGCGACAAGG 
CCTGGACACC 
TCATGCCCGT 
ACCGGGCGTG 
TGTCGGCCTC 
TCAGGAAGCT 
CTGCGCGGCA 
ACCCGGCTCG 
TGAGCAAGGA 
CCGCAGTTCT 
CAAGGACAAG 
AGCTCATCGC 
GAAGGCGGCC 



GGCGCCATCC 
CGAGCACATC 
TCCTCAACAA 
GAGATGGAAG 
CACCCCCATC 
GCGAACTGGG 
TACATCCCCA 
GGAAGACGTG 
TGGAGCGCGG 
AAGCCCACCC 
GCTCGACCAG 
CCAAGCGCGA 
ATCAAGCCCC 
CGAGGGCGGC 
ACTTCCGCAC 
GAAATGGTCA 
CCCCATCGCC 
GCACCGTCGG 



TGGTCGTGTC 50 

CTGCTGGCGC 100 

GTGCGACATG 150 

TGCGCGAGCT 200 

ATCAAGGGCT 250 

CGAAGGCGCC 300 

CGCCCGAGCG 350 

TTCTCCATCT 400 

CATCATCAAG 450 

TCAAGACCAC 500 

GGCCAGGCCG 550 

GGAAGTCGAG 600 

ACACCCACTT 650 

CGCCACACCC 700 

CACCGACGTC 750 

TGCCCGGCGA 800 

ATGGAAGAAG 850 

CGCCGGC 897 



i5 



2) INFORMATION FOR SEQ ID NO: 1752 



50 



(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 894 bases 

(B) TYPE: Nucleic acid 

(C) STRANDEDNESS: Double 

(D) TOPOLOGY: Linear 



915 
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(ii) MOLECULE TYPE: Genomic DNA 



(vi) ORIGINAL SOURCE: 

(A) ORGANISM: Treponema pallidum 

(B) STRAIN: Nichols 

(C) ACCESSION NUMBER: AE001202 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 1752 



10 



15 



20 



25 



AATATGATCA 
TGCGCCTGAC 
GTCAGGTTGG 
GTTGATGATC 
GCTTGCTGGA 
CGTTTAAAGC 
GAGGAACTGC 
CGACGCAAGA 
GGCGTGGTAC 
AATGAAGAGG 
TACTGGCATT 
ATAACGTGGG 
GGTCAGGTGC 
GGCGCAGATC 
TTTTTCAAGG 
GGTACGATTT 
CACCAAGATT 
TGAAGCTTGC 



CGGGTGCTGC 
GGCGTTATGC 
TGTTCCCTCC 
CTGAGTTGCT 
TATGGGTTTT 
TCTGCAGGAT 
TTGCGGCCAT 
CCTTTCTTGC 
CGTTGTCACG 
TCGAGATCGT 
GAGATGTTTA 
GCTGCTTTTG 
TTTCTAAGCC 
TACGTGCTCT 
TTATCGTCCG 
CTCTTCCTGA 
ATAGGTGAGC 
GATTCGTGAA 



GCAGATGGAC 
CACAGACGAA 
ATCATTGTTT 
AGAGCTGGTG 
CGCGTGAGAC 
GGCGCTTCCC 
GGATTCCTAC 
TCTCTATCGA 
GGGCGCATCG 
CGGGATTAAG 
ATAAGTTGCT 
CGCGGGGTGG 
CGGTTCTATT 
CTAAGGAAGA 
CAGTTTTATT 
AGGGGTAGAC 
TCATCCACCC 
GGGGGGCGCA 



GGTGGTATTC 
GGAGCATCTT 
TTTTGAACAA 
GAAGAAGAGG 
GCCTATCGTC 
CGGAGGATGC 
TTTGAAGACC 
GGATGTGTAC 
AATGTGGGGT 
CCCACTAAGA 
TGATCAGGGA 
ATAAAAAAGA 
AAGCCACACA 
GGGTGGCCGT 
TTAGAACTAC 
ATGGTGAAGC 
GATAGCTATG 
CTATTGCTTC 



TCGTCGTGTC 50 

CTGCTCGCCC 100 

GGTTGATTTG 150 

TGCGTGATGC 200 

AAGGGGTCTG 250 

AGCTTGTATT 300 

CAGTGCGTGA 350 

ACTATTTCTG 400 

AATTAGTCTG 4 50 

AAACAGTGGT 500 

ATTGCAGGTG 550 

GGTTGAGCGC 600 

CCAAGTTTGA 650 

CACAGTCCTT 700 

TGACATTACC 750 

CGGGGGATAA 800 

GACAAGGGTC 850 

TGGT 894 



30 



2) INFORMATION FOR SEQ ID NO: 1753 

{i) SEQUENCE CHARACTERISTICS: 
35 (A) LENGTH: 891 bases 

(B) TYPE: Nucleic acid 

(C) STRANDEDNESS : Double 

(D) TOPOLOGY: Linear 



40 



(ii) MOLECULE TYPE: Genomic DNA 



45 



(vi) ORIGINAL SOURCE: 

(A) ORGANISM: Ureaplasma urealyticum 

(B) STRAIN: ATCC 33697 

(C) ACCESSION NUMBER: Z34275 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 1753 



AATATGATTA 
50 TGCATCTGAT 
GTCAAGTTGG 
ATGACAGATC 
ATTATCTAAA 



CAGGGGCAGC 
GGGGTTATGG 
TGTTCCAAAA 
CAGATATGCA 
TATGGATTTG 



ACAAATGGAT GGAGCAATTT 

CTCAAACTAA AGAACATATT 

ATCGTTGTTT TCTTAAACAA 

AGATCTTGTT GAAATGGAAG 

ATGGCGATAA CACACCAGTT 



TAGTTATTGC 50 

TTATTAGCAC 100 

ATGTGATTTC 150 

TTCGTGAATT 200 

ATTCGTGGTT 250 
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CAGGTCTTAA 
GAATTAATGG 
TGACAAACCA 
GTGGTACAGT 
GATGAGGTTG 
AGGAATTGAA 
ATGCTGGTAT 
CAAGTACTTG 
TAAAGTTTAT 
TTTCAGGATA 
GCTATTTCAT 
TGAAATGACT 
AATTCTCAAT 



GGCTTTAGAA 
ACGCAGTTGA 
TTCTTATTAG 
AGTAACTGGA 
AAATTGTTGG 
ATGTTTAGAA 
TTTATTACGT 
TAAAACCAGG 
ATTCTTAAAA 
CCGTCCACAA 
TACCTGCTGG 
GTAGAATTAA 
CCGTGAAGGT 



GGAGATCCAG 
TTCATGAATT 
CAATTGAAGA 
CGTGTTGAAC 
TCTAAAAGAC 
AATCATTAGA 
GGTATTAAAA 
ATCAATTAAA 
AAGAAGAAGG 
TTCTATTTTA 
TGTTGATTTG 
TTGCTCCAGT 
GGT7\AAACTG 



TTTGAGAAGC 
CCATTACCAG 
TGTATTCACA 
GTGGTGTATT 
ACTCAAAAAA 
TCAAGCTGAA 
AAGAAGATGT 
CCTCACCGTA 
TGGACGTCAT 
GAACAACAGA 
GTTATGCCAG 
TGCGATTGAA 
TAGGTCATGG 



AAAAATTGAT 300 

AACGTAGTAC 350 

ATTTCAGGAC 400 

AAAAGTTAAT 450 

CTGTTGTTAC 500 

GCTGGTGATA 550 

TGAACGTGGT 600 

CTTTTACTGC 650 

ACACCTATTG 700 

TGTAACAGGT 750 

GTGATGACGT 800 

GATGGATCTA 850 

T 891 



15 



2) INFORMATION FOR SEQ ID NO: 1754 

(i) SEQUENCE CHARACTERISTICS: 
20 (A) LENGTH: 909 bases 

(B) TYPE: Nucleic acid 

(C) STRANDEDNESS: Double 

(D) TOPOLOGY: Linear 

25 (ii) MOLECULE TYPE: Genomic DNA 

(vi) ORIGINAL SOURCE: 

(A) ORGANISM: Wolinslla succlnogenes 

(B) STRAIN: DSM 1740 

30. (C) ACCESSION NUMBER: X76862 

(xi.) SEQUENCE DESCRIPTION: SEQ ID NO: 1754 



AACATGATTA 

35 TGCGGCGGAT 
GACAAGTAGG 
GTTGATGACG 
TCTTAGCAAC 
CCGCTCTTAA 

40 TGGGGCGAGA 
TACGCCTGAG 
TATTCTCCAT 
GGCGTGGTTA 
ACAAAAAACA 

45 AGGGTGAGGC 
GAAGATGTTG 
TCACACTAAC 
GACGACACAC 
ACTACAGACG 

50 TATGCCTGGT 
CCCTCGAAGA 
GGTGCGGGT 



CAGGTGCTGC 
GGCCCCATGC 
CGTTCCTTAC 
CTGAGCTTCT 
TACGACTTCC 
AGCTCTTGAA 
AAGTATTGAA 
CGAGATGTGG 
CGCGGGTCGT 
AAGTCGGTGA 
ACCGTAACTG 
GGGTGACAAC 
AGAGAGGTAT 
TTTGAAGGTG 
TCCATTCTTC 
TTACCGGTTC 
GACAACGTTA 
GGGAACACGA 



TCATVATGGAT 
CCCAAACTAG 
ATCGTGGTTT 
TGAGCTTGTT 
CTGGAGATGA 
GAGGCTAACG 
GCTTATGGCT 
ATAAGCCTTT 
GGAACCGTTG 
CGAAGTAGAA 
GCGTTGAGAT 
GTTGGTGTTC 
GGTTCTTTGT 
AAGTTTACGT 
AATGGATACC 
TATCTCTCTT 
AGATCAATGT 
TTCGCGATCC 



GGCGCGATTC 
GGAGCACATT 
TCTTGAACAA 
GAAATGGAAG 
CACTCCTATC 
ACCAGGAAAA 
GAGGTTGACC 
CCTTATGCCT 
TGACAGGAAG 
ATCGTTGGTA 
GTTCCGAAAA 
TTTTGAGAGG 
AAAATAGGTT 
TCTTTCCAAA 
GACCTCAGTT 
CCTGAGGGCG 
TGAGCTTATC 
GTGAAGGTGG 



TTGTTGTTTC 50 

CTTCTTTCTC 100 

AGAAGATATG 150 

TTAGAGAACT 200 

GTTGCAGGTT 250 

TGTTGGCGAG 300 

GATATATTCC 350 

GTTGAAGACG 4 00 

AATTGAAAGA 450 

TCCGAAACAC 500 

GAGCTCGACA 550 

CACCAAGAAA 600 

CTATCACTCC 650 

GAGGAAGGCG 700 

CTATGTTAGA 750 

TAGAGATGGT 800 

GCTCCTGTAG 850 

TCGAACCGTT 900 

909 
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2) INFORMATION FOR SEQ ID NO: 1755 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 888 bases 

(B) TYPE: Nucleic acid 

(C) STRANDEDNESS : Double 

(D) TOPOLOGY: Linear 

(ii) MOLECULE TYPE: Genomic DNA 

(vi) ORIGINAL SOURCE: 

(A) ORGANISM: Burkholderia cepacia 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 1755 



PCT/CAOO/01150 



15 



20 



25 



30 



ATGATCACGG 
AGCAGACGGC 
AGGTTGGTGT 
GACGACGCTG 
GTCGAAGTAC 
CCAAGCTGGC 
ATGAGCCTGG 
AGTTGACGGC 
GCCGTGGTAC 
GGCGAAGAAA 
CACGGGCGTT 
ACAACGTCGG 
GGCCAGGTTC 
GGCTGAAGTG 
TCTTCAACAA 
GGCTCGATCG 
CGTGTCGATC 
TGCGCTTCGC 



GCGCAGCGCA 
CCGATGCCGC 
TCCGTACATC 
AACTGCTCGA 
GACTTCCCGG 
GCTGGAAGGC 
CAGACGCGCT 
GCGTTCCTGA 
GGTGGTGACG 
TCGAAATCGT 
GAAATGTTCC 
TATCCTGCTG 
TGGCGAAGCC 
TACGTGCTGA 
CTACCGTCCG 
AGCTGCCGAA 
ACGGTGAAGC 
AATCCGTGAA 



GATGGACGGC 
AAACGCGTGA 
ATCGTGTTCC 
GCTGGTCGAG 
GCGACGACAC 
GACACGGGCG 
GGACACGTAC 
TGCCGGTGGA 
GGTCGTGTCG 
CGGTATCAAG 
GC/U^GCTGCT 
CGCGGCACGA 
GGGTTCGATC 
GCAAGGACGA 
CAGTTCTACT 
GGACAAGGAA 
TGATTGCTCC 
GGCGGCCGTA 



GCGATCCTGG 
GCACATCCTG 
TGAACAAGTG 
ATGGAAGTTC 
GCCGATCGTG 
AGCTGGGCGA 
ATCCCGACGC 
AGACGTGTTC 
AGCGCGGCAT 
CCGACGGTGA 
GGACCAAGGT 
AGCGTGAAGA 
ACGCCGCACA 
AGGCGGCCGT 
TCCGTACGAC 
ATGGTGATGC 
GATCGCGATG 
CGGTCGGC 



TTTGCTCGGC 50 

CTGGCGCGTC 100 

CGACAGTGTG 150 

GCGAACTCCT 200 

AAGGGTTCGG 250 

AGTGGCGATC 300 

CGGAGCGTGC 350 

TCGATCTCGG 400 

CGTGAAGGTC 450 

AGACGACCTG 500 

CAGGCAGGCG 550 

CGTGGAGCGT 600 

CGCACTTCAC 650 

CACACGCCGT 700 

GGACGTGACG 750 

CGGGCGACAA 800 

GAAGAAGGTC 850 

888 



35 



40 



45 



50 



2) INFORMATION FOR SEQ ID NO: 1756 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 388 bases 

(B) TYPE: Nucleic acid 

(C) STRANDEDNESS: Double 

(D) TOPOLOGY: Linear 

(ii) MOLECULE TYPE: Genomic DNA 

(vi) ORIGINAL SOURCE: 

(A) ORGANISM: Bacillus anthracis 

(B) STRAIN: CIP 9444 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 1756 

TGAAAGTTCA GGTAAAACAA CAGTTTCATT ACACGCAATT GCAGAAGTAC 

918 
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AGCGTCAAGG TGGACAAGCA 

CCTGTATATG CACAAAAACT 

ACAACCTGAT ACAGGGGAGC 

GAAGTGGTGC GGTTGATATT 

5 CCGAAAGCTG AAATTGAAGG 

AGCTCGTCTA ATGTCTCAAG 

AATCAAAAAC AATCGCAATC 



GCGTTCATTG ATGCTGAGCA 
AGGTGTTAAC ATCGATGAAT 
AAGGTTTAGA AATCGCAGAA 
ATCGTAATTG ACTCTGTAGC 
AGACATGGGT GACTCACACG 
CACTTCGTAA ACTTTCAGGT 
TTTATTAACC AAATTCGT 



PCT/CAOO/01150 

TGCAATGGAT 100 

TACTATTATC 150 

GCACTTGTAC 200 

AGCTCTTGTA 250 

TAGGTTTACA 300 

GCAATCAATA 350 

388 



10 



20 



2) INFORMATION FOR SEQ ID NO: 1757 



(i) SEQUENCE CHARACTERISTICS: 
(A) LENGTH: 388 bases 
15 (B) TYPE: Nucleic acid 

(C) STRANDEDNESS: Double 

(D) TOPOLOGY: Linear 



(ii) MOLECULE TYPE: Genomic DNA 

(vi) ORIGINAL SOURCE: 

(A) ORGANISM: Bacillus anthracis 

(B) STRAIN: ATCC 4229 

25 (xi) SEQUENCE DESCRIPTION: SEQ ID NO: 1757 

TGAAAGTTCA GGTAAAACAA CAGTTTCATT ACACGCAATT GCAGAAGTAC 50 

AGCGTCAAGG TGGACAAGCA GCGTTCATTG ATGCTGAGCA TGCAATGGAT 100 

CCTGTATATG CACAAAAACT AGGTGTTAAC ATCGATGAAT TACTATTATC 150 

3 0 ACAACCTGAT ACAGGGGAGC AAGGTTTAGA AATCGCAGAA GCACTTGTAC 200 

GAAGTGGTGC GGTTGATATT ATCGTAATTG ACTCTGTAGC AGCTCTTGTA 250 

CCGAAAGCTG AAATTGAAGG AGACATGGGT GACTCACACG TAGGTTTACA 300 

AGCTCGTCTA ATGTCTCAAG CACTTCGTAA ACTTTCAGGT GCAATCAATA 350 

AATCAAAAAC AATCGCAATC TTTATTAACC AAATTCGT 388 

35 



2) INFORMATION FOR SEQ ID NO: 1758 

40 (i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 38 8 bases 

(B) TYPE: Nucleic acid 

(C) STRANDEDNESS: Double 

(D) TOPOLOGY: Linear 



45 



(ii) MOLECULE TYPE: Genomic DNA 



(vi) ORIGINAL SOURCE: 

(A) ORGANISM: Bacillus cereus 
50 (B) STRAIN: ATCC 7064 

(C) ACCESSION NUMBER: 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 1758 

919 
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TGAAAGTTCA GGTAAAACAA 
AGCGTCAAGG TGGACAAGCA 
CCTGTATATG CACAAAAACT 
5 ACAACCTGAT ACAGGGGAGC 
GAAGTGGTGC GGTTGATATT 
CCGAAAGCTG AAATTGAAGG 
AGCTCGTCTA ATGTCTCAAG 
AATCAAAAAC AATCGCAATC 



CAGTTTCATT ACACGCAATT 
GCGTTCATTG ATGCTGAGCA 
AGGTGTTAAC ATCGATGAAT 
AAGGTTTAGA AATCGCAGAA 
ATCGTAATTG ACTCTGTAGC 
AGACATGGGT GACTCACACG 
CACTTCGTAA ACTTTCAGGT 
TTTATTAACC AAATTCGT 



GCAGAAGTAC 50 

TGCAATGGAT 100 

TACTATTATC 150 

GCACTTGTAC 200 

AGCTCTTGTA '250 

TAGGTTTACA 300 

GCAATCAATA 350 

388 



2) INFORMATION FOR SEQ ID NO: 1759 

15 (i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 388 bases 

(B) TYPE: Nucleic acid 

(C) STRANDEDNESS: Double 

(D) TOPOLOGY: Linear 



20 



40 



50 



(ii) MOLECULE TYPE: Genomic DNA 



(vi) ORIGINAL SOURCE: 

(A) ORGANISM: Bacillus cereus 
25 (B) STRAIN: ATCC 13472 

(C) ACCESSION NUMBER: 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 1759 

3 0 TGAAAGTTCA GGTAAAACGA CAGTTTCATT ACATGCAATT GCAGAAGTAC 50 

AACGTCAAGG TGGACAAGCA GCATTCATCG ATGCGGAGCA CGCAATGGAT 100 

CCTGTATATG CACAAAAATT AGGCGTTAAC ATAGATGAAT TACTATTATC 150 

ACAGCCTGAT ACAGGGGAGC AAGGATTAGA AATCGCGGAA GCACTTGTAC 200 

GAAGTGGTGC GGTTGACATT ATCGTAATTG ACTCTGTAGC AGCTCTTGTA 250 

35 CCGAAAGCAG AGATTGAAGG TGACATGGGT GACTCACACG TAGGTTTACA 300 

AGCACGTTTA ATGTCACAAG CACTTCGTAA GCTTTCAGGA GCAATCAACA 350 

AATCAAAAAC AATTGCAATC TTTATTAACC AAATTCGT 388 



2) INFORMATION FOR SEQ ID NO: 1760 



(i) SEQUENCE CHARACTERISTICS: 
(A) LENGTH: 374 bases 
45 (B) TYPE: Nucleic acid 

(C) STRANDEDNESS: Double 

(D) TOPOLOGY: Linear 



(ii) MOLECULE TYPE: Genomic DNA 

(vi) ORIGINAL SOURCE: 

(A) ORGANISM: Bacillus my co ides 

(B) STRAIN: ATCC 64 62 
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(xi) SEQUENCE DESCRIPTION: SEQ 

AAACGACAGT TTCATTACAT GCAATTGCAG 

5 CAAGCAGCAT TCATCGATGC GGAGCACGCA 

AAAATTAGGC GTTAACATAG ATGAATTACT 

GGGAGCAAGG ATTAGAAATC GCAGAAGCAC 

GACATTATCG TAATTGACTC TGTAGCAGCT 

TGAAGGAGAC ATGGGTGACT CACACGTAGG 

10 CACAAGCACT TCGTAAGCTT TCAGGAGCAA 

GCAATCTTTA TTAACCAAAT TCGT 



ID NO: 1760 

AAGTACAACG TCAAGGTGGA 50 

ATGGATCCTG TATATGCACA 100 

ATTATCACAG CCTGATACAG 150 

TTGTACGAAG TGGTGCGGTT 200 

CTTGTACCGA AAGCAGAGAT 250 

TTTACAAGCA CGTTTAATGT 300 

TCAACAAATC AAAAACAATT 350 

374 



15 2) INFORMATION FOR SEQ ID NO: 1761 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 381 bases 

(B) X TYPE: Nucleic acid 
20 (C) STRANDEDNESS : Double 

(D) TOPOLOGY: Linear 

(ii) MOLECULE TYPE: Genomic DNA 

25 (vi) ORIGINAL SOURCE: 

(A) ORGANISM: Bacillus pseudomycoid.es 

(B) STRAIN: NRRL BD-10 



30 



40 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 17 61 



GGAAAGTTCA GGTAAAACAA CGGTTTCCTT ACATGCGATT GCAGAAGTGC 50 

AACGTCAAGG TGGACAAGCG GCATTTATTG ATGCGGAGCA TGCGATGGAT 100 

CCTGTATATG CACAAAAGTT AGGTGTTAAT ATTGATGAGT TACTATTATC 150 

GCAGCCTGAT ACAGGAGAAC AAGGTTTAGA AATCGCAGAA GCATTAGTAC 200 

35 GAAGCGGTGC GATTGATATC ATTGTAATTG ACTCTGTAGC AGCTCTTGTA 250 

CCAAAAGCAG AAATCGAAGG GGAAATGGGT GACTCCCACG TTGGTTTACA 300 

AGCGCGTTTA ATGTCACAAG CACTTCGTAA GCTTTCTGGT GCGATTAACA 350 

AATCAAAAAC AATTGCAATC TTCATTAACC A 381 



2) INFORMATION FOR SEQ ID NO: 1762 



(i) SEQUENCE CHARACTERISTICS: 
45 (A) LENGTH: 388 bases 

(B) TYPE: Nucleic acid 

(C) STRANDEDNESS: Double 

(D) TOPOLOGY: Linear 

50 (ii) MOLECULE TYPE: Genomic DNA 

(vi) ORIGINAL SOURCE: 

(A) ORGANISM: Bacillus thuringiensis 
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(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 1762 

5 TGAAAGTTCA GGTAAAACGA CAGTTTCATT ACATGCAATT GCAGAAGTAC 50 

AACGTCAAGG TGGACAAGCA GCATTCATCG ATGCGGAGCA CGCAATGGAT 100 

CCTGTATATG CACAAAAATT AGGCGTTAAC ATAGATGAAT TACTATTATC 150 

ACAGCCTGAT ACAGGGGAGC 7VAGGATTAGA AATCGCGGAA GCACTTGTAC 200 

GAAGTGGTGC GGTTGACATT ATCGTAATTG ACTCTGTAGC AGCTCTTGTA 250 

10 CCGAAAGCAG AGATTGAAGG CGACATGGGT GACTCACACG TAGGTTTACA 300 

AGCACGTTTA ATGTCACAAG CACTTCGTAA GCTTTCAGGA GCAATCAACA 350 

AATCAAT^C AATTGCAATC TTTATTAACC AAATTCGT 388 



15 



25 



2) INFORMATION FOR SEQ ID NO: 1763 



(i) SEQUENCE CHARACTERISTICS: 
(A) LENGTH: 388 bases 
20 (B) TYPE: Nucleic acid 

(C) STRANDEDNESS: Double 

(D) TOPOLOGY: Linear 



(ii) MOLECULE TYPE: Genomic DNA 

(vi) ORIGINAL SOURCE: 

(A) ORGANISM: Bacillus thuringiensis 

(B) STRAIN: HER 1418 

30 (xi) SEQUENCE DESCRIPTION: SEQ ID NO: 1763 

TGAAAGTTCA GGTAAAACGA CAGTTTCATT ACATGCAATT GCAGAAGTAC 50 

AACGTCAAGG TGGACAAGCA GCATTCATTG ATGCGGAGCA CGCAATGGAT 100 

CCTGTATATG CACAAAAATT AGGCGTTAAC ATAGATGAAT TACTATTATC 150 

35 ACAGCCTGAT ACAGGGGAGC AAGGATTGGA AATCGCGGAA GCACTTGTAC 200 

GAAGTGGTGC GGTTGACATT ATCGTAATTG ACTCTGTAGC AGCTCTTGTA 250 

CCGAAAGCAG AGATTGAAGG CGATATGGGT GACTCACACG TAGGTTTACA 300 

AGCACGTTTA ATGTCACAAG CACTTCGTAA GCTTTCAGGA GCAATCAACA 350 

AATCAAAAAC AATTGCAATC TTTATTAACC AAATTCGT 388 

40 



2) INFORMATION FOR SEQ ID NO: 1764 

45 (i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 358 bases 

(B) TYPE: Nucleic acid 

(C) STRANDEDNESS: Double 

(D) TOPOLOGY: Linear 



50 



(ii) MOLECULE TYPE: Genomic DNA 
(vi) ORIGINAL SOURCE: 
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(A) ORGANISM: Klebsiella oxytoca 

(B) STRAIN: ATCC 334 96 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 1764 



CTCCTATCTG GATTATGCGA 
ATGTCCGAGA TGGCCTGAAG 
AACGTATTGG GCAATGACTG 
CGTGGGTGAC GTCATCGGTA 
ACGACACCAT TGTACGTATG 
GTAGATGGCC AGGGTAACTT 
GATGCGTTAT ACGGAAATCC 
CCGACCTC 



TGTCGGTCAT TGTTGGCCGT GCGCTGCCGG 
CCGGTACACC GTCGCGTACT ATACGCCATG 
GAACAAAGCC TATAAAAAAT CTGCCCGTGT 
AATACCACCC TCATGGTGAT ACTGCCGTAT 
GCGCAGCCCT TCTCCCTGCG TTACATGCTG 
TGGTTCGGTC GACGGCGACT CCGCCGCAGC 
GTATGTCGAA GATCGCCCAT GAACTGATGG 



50 
100 
150 
200 
250 
300 
350 
358 



2) INFORMATION FOR SEQ ID NO: 1765 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 365 bases 

(B) TYPE: Nucleic acid 

(C) STRANDEDNESS: Double 

(D) TOPOLOGY: Linear 

(ii) MOLECULE TYPE: Genomic DNA 

(vi) ORIGINAL SOURCE: 

(A) ORGANISM: Klebsiella pneumoniae subsp. ozaenae 

(B) STRAIN: ATCC 11296 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 1765 

TTAAGAACTC TTATCTGGAT TATGCGATGT CGGTCATTGT TGGCCGTGCG 50 

CTGCCGGATG TCCGAGATGG CCTGAAGCCG GTACACCGTC GCGTACTTTA 100 

CGCCATGAAC GTATTGGGCA ATGACTGGAA CAAAGCCTAT AAAAAATCAG 150 

CCCGTGTCGT TGGTGACGTA ATCGGTAAAT ACCACCCGCA CGGCGACTCC 200 

GCGGTATACG ACACCATCGT GCGTATGGCG CAGCCGTTCT CGCTGCGTTA 250 

CATGCTGGTG GACGGCCAGG GTAACTTTGG TTCCATCGAC GGCGACTCCG 300 

CCGCGGCGAT GCGTTATACC GAAATTCGTC TGGCGAAAAT CGCTCATGAG 350 

CTGATGGCCG ATCTT 365 



2) INFORMATION FOR SEQ ID NO: 1766 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 344 bases 

(B) TYPE: Nucleic acid 

(C) STRANDEDNESS: Double 

(D) TOPOLOGY: Linear 

(ii) MOLECULE TYPE: Genomic DNA 
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(vi) ORIGINAL SOURCE: 

(A) ORGANISM: Klebsiella planticola 

(B) STRAIN: ATCC 33531 

5 (xi) SEQUENCE DESCRIPTION: SEQ ID NO: 1766 

AAGAGCTCGT ATCTGGATTA TGCGATGTCG GTCATTGTTG GCCGTGCGCT 50 

GCCGGATGTC CGAGATGGCC TGAAACCGGT ACACCGTCGC GTACTTTACG 100 

CCATGAACGT ATTGGGCAAT GACTGGAACA AAGCCTATAA AAAATCCGCC 150 

10 CGTGTCGTTG GTGACGTAAT CGGTAAATAC CACCCTCATG GTGATACCGC 200 
CGTTTATGAC ACCATTGTAC GTATGGCACA GCCATTCTCC TTGCGTTATA 250 

TGCTGGTCGA TGGCCAGGGT AACTTCGGTT CTGTCGATGG CGACTCCGCC 300 

GCAGCGATGC GTTATACGGA AATCCGTATG TCGAAAATCG CCCA 344 



15 



30 



2) INFORMATION FOR SEQ ID NO: 1767 



(i) SEQUENCE CHARACTERISTICS: 
20 (A) LENGTH: 345 bases 

(B) TYPE: Nucleic acid 

(C) STRANDEDNESS: Double 

(D) TOPOLOGY: Linear 

25 (ii) MOLECULE TYPE: Genomic DNA 

(vi) ORIGINAL SOURCE: 

(A) ORGANISM: Klebsiella pneumoniae 

(B) STRAIN: ATCC 27336 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 1767 



AACTCTTATC TGGATTATGC GATGTCGGTC ATTGTTGGCC GTGCGCTGCC 50 

GGATGTCCGA GATGGCCTGA AGCCGGTACA CCGTCGCGTA CTTTACGCCA 100 

35 TGAACGTATT GGGCAATGAC TGGAACAAAG CCTATAAAAA ATCAGCCCGT 150 

GTCGTTGGTG ACGTAATCGG TAAATACCAC CCGCACGGCG ACTCCGCGGT 200 

ATACGACACC ATCGTGCGTA TGGCGCAGCC GTTCTCGCTG CGTTACATGC 250 

TGGTGGACGG CCAGGGTAAC TTTGGTTCCA TCGACGGCGA CTCCGCCGCG 300 

GCGATGCGTT ATACCGAAAT TCGTCTGGCG AAAATCGCTC ATGAG 345 

40 



2) INFORMATION FOR SEQ ID NO: 1768 

45 (i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 356 bases 

(B) TYPE: Nucleic acid 

(C) STRANDEDNESS: Double 

(D) TOPOLOGY: Linear 



50 



(ii) MOLECULE TYPE: Genomic DNA 
(vi) ORIGINAL SOURCE: 

924 



SUBSTITUTE SHEET (RULE 26) 



wo 01/23604 



PCT/CAOO/01150 



(A) ORGANISM: Klebsiella pneumoniae subsp. pneumoniae 

(B) STRAIN: ATCC 13883 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 17 68 

5 

CTTATCTGGA TTATGCGATG TCGGTCATTG TTGGCCGTGC GCTGCCGGAT 50 

GTCCGAGATG GCCTGAAGCC GGTACACCGT CGCGTACTTT ACGCCATGAA 100 

CGTATTGGGC AATGACTGGA ACAAAGCCTA TAAAAAATCA GCCCGTGTCG 150 

TTGGTGACGT AATCGGTAAA TACCACCCGC ACGGCGACTC CGCGGTATAC 200 

10 GACACCATCG TGCGTATGGC GCAGCCGTTC TCGCTGCGTT ACATGCTGGT 250 

GGACGGCCAG GGTAACTTTG GTTCCATCGA CGGCGACTCC GCCGCGGCGA 300 

TGCGTTATAC CGAAATTCGT CTGGCGAAAA TCGCTCATGA GCTGATGGCC 350 

GATCTT 356 

15 

2) INFORMATION FOR SEQ ID NO: 1769 

(i) SEQUENCE CHARACTERISTICS: 
20 (A) LENGTH: 3 61 bases 

(B) TYPE: Nucleic acid 

(C) STRANDEDNESS: Double 

(D) TOPOLOGY: Linear 

25 (ii) MOLECULE TYPE: Genomic DNA 

(vi) ORIGINAL SOURCE: 

(A) ORGANISM: Klebsiella pneumoniae subsp. pneumoniae 

(B) STRAIN: ATCC 29011 
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(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 1769 



TAAGAACTCT TATCTGGATT ATGCGATGTC GGTCATTGTT GGCCGTGCGC 50 

TGCCGGATGT CCGAGATGGC CTGAAGCCGG TACACCGTCG CGTACTTTAC 100 

35 GCCATGAACG TATTGGGCAA TGACTGGAAC AAAGCCTATA AAAAATCAGC 150 

CCGTGTCGTT GGTGACGTAA TCGGTAAATA CCACCCGCAC GGCGACTCCG 200 

CGGTATACGA CACCATCGTG CGTATGGCGC AGCCGTTCTC GCTGCGTTAC 250 

ATGCTGGTGG ACGGCCAGGG TAACTTTGGT TCCATCGACG GCGACTCCGC 300 

CGCGGCGATG CGTTATACCG AAATTCGTCT GGCGAAAATC GCTCATGAGC 350 

40 TGATGGCCGA T 361 



2) INFORMATION FOR SEQ ID NO: 1770 

45 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 365 bases 

(B) TYPE: Nucleic acid 

(C) STRANDEDNESS: Double 
50 (D) TOPOLOGY: Linear 



(ii) MOLECULE TYPE: Genomic DNA 
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(vi) ORIGINAL SOURCE: 

(A) ORGANISM: Klebsiella pneumoniae subsp. 
rhinoscleromatis 

(B) STE^IN: ATCC 13824 

5 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 1770 

TAAGAACTCT TATCTGGATT ATGCGATGTC GGTCATTGTT GGCCGTGCGC 50 

TGCCGGATGT CCGAGATGGC CTGAAGCCGG TACACCGTCG CGTACTTTAC 100 

10 GCCATGAACG TATTGGGCAA TGACTGGAAC AAAGCCTATA AAAAATCAGC 150 

CCGTGTCGTT GGTGACGTAA TCGGTAAATA CCACCCGCAC GGCGACTCCG 200 

CGGTATACGA CACCATCGTG CGTATGGCGC AGCCGTTCTC GCTGCGTTAC 250 

ATGCTGGTGG ACGGCCAGGG TAACTTTGGT TCCATCGACG GCGATTCCGC 300 

CGCGGCGATG CGTTATACCG AAATTCGTCT GGCGAAAATC GCTCATGAGC 350 

15 TGATGGCCGA TCTTG 365 



2) INFORMATION FOR SEQ ID NO: 1771 



20 



(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 357 bases 

(B) TYPE: Nucleic acid 

(C) STRANDEDNESS: Double 
25 (D) TOPOLOGY: Linear 

(ii) MOLECULE TYPE: Genomic DNA 

(vi) ORIGINAL SOURCE: 
30 (A) ORGANISM: Klebsiella terrigena 

(B) STRAIN: ATCC 33257 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 1771 

35 TCGTATCTGG ATTATGCGAT GTCGGTCATT GTTGGCCGTG CGCTGCCGGA 50 

TGTCCGAGAT GGGTTGAAAC CGGTACACCG TCGCGTACTT TACGCCATGA 100 

ACGTATTGGG CT^TGACTGG AACAAAGCCT ATAAAAAATC CGCCCGTGTC 150 

GTTGGTGACG TAATCGGTAA ATATCACCCT CACGGTGATA CCGCCGTTTA 200 

TGACACCATT GTACGTATGG CGCAGCCATT CTCCTTGCGT TATATGCTGG 250 

40 TCGATGGCCA GGGTAACTTC GGTTCTGTCG ATGGCGACTC CGCCGCAGCG 300 

ATGCGTTATA CGGAAATCCG TATGTCGAAA ATCGCCCACG AGCTGATGGC 350 

CGACCTC 357 



45 

2) INFORMATION FOR SEQ ID NO: 1772 

(i) SEQUENCE CHARACTERISTICS: 
(A) LENGTH: 968 bases 
50 (B) TYPE: Nucleic acid 

(C) STRANDEDNESS: Double 

(D) TOPOLOGY: Linear 
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(ii) MOLECULE TYPE: Genomic DNA 

(vi) ORIGINAL SOURCE: 

(A) ORGANISM: Legionella pneumophila subsp. pneumophila 
5 (B) STRAIN: ATCC 33152 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 1772 

CAGTACACAG GCGAGTTCTT TTTGCGATGA GTGAGTTAAG TAATGATTGG 50 

10 AATAAGCCGT ATAAAAAATC TGCTCGTGTA GTAGGGGATG TCATTGGTAA 100 

ATATCATCCT CATGGGGATA CAGCTGTTTA TGACACTATT GTTCGTATGG 150 

CTCAGCCCTT TTCCATGCGT TATATGCTGA TTGATGGGCA GGGTAATTTT 200 

GGCTCTGTAG ATGGAGATGC TCCAGCTGCC ATGCGTTACA CTGAAGTAAG 250 

AATGTCCAAA GTGGCGCATG CTTTACTGGC TGATTTGGAT AAGGAT^CCG 300 

15 TTGATTTTAG TCCTAACTAT GATGAAACAG AATTTGCTCC AGTGGTATTG 350 

CCATCGAGAA TTCCCAATTT ACTAGTTAAT GGCTCTTCCG GTATTGCGGT 400 

AGGGATGGCT ACTAATATTC CACCACATAA TCTTACCGAA GTAATCAATG 4 50 

CATGTATTGC TTTAGTGGAT GAACCTGACA CGAGTCTTGA AGATTTAATG 500 

GAAATTATTC CTGGCCCTGA TTTTCCTACA GCCGCAATTA TTAATGGTCG 550 

20 TGCTGGAATT ATTGAAGGTT ATCGTACTGG AAAAGGGCGG GTTGTTATCA 600 

GGGCACGCAC AGAAATTGAA ACGGATGAAA GTTCAGGCCG TCAGTCAATT 650 

ATTATTCAGG AATTACCCTA TCAGGTGAAT AAAGCGCGTT TGATCGAGCG 700 

TATTGCTGAA TTGGTAAGGG ACAAGAAAAT CGAAGGAATT TCCGGCTTGA 750 

GAGATGAGTC AGACAAGCAA GGAATGAGAG TAGTCATTGA ATTAAAACGC 800 

25 AATGAAGTAG CAGATGTGGT ATTGAATAAC CTGTTCGCTC ATACTCAAAT 850 

GCAAAATGTA TTCGGAATTA ATATGGTTGC TCTGGTGGAT GGCCAACCGC 900 

GTACTTTGAA TTTGAAGCAA ATACTGGAAT ATTTTATAAA ACATCGAAGA 950 

GAGGTTGTTA CCAGACGC , 968 



30 



45 



2) INFORMATION FOR SEQ ID NO: 1773 



(i) SEQUENCE CHARACTERISTICS: 
35 (A) LENGTH: 967 bases 

(B) TYPE: Nucleic acid 

(C) STRANDEDNESS: Double 

(D) TOPOLOGY: Linear 

4 0 (ii) MOLECULE TYPE: Genomic DNA 

(vi) ORIGINAL SOURCE: 

(A) ORGANISM: Proteus mirabilis 

(B) STRAIN: ATCC 25933 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 1773 



CACCGCCGAG TGCTATTTGC GATGAATGTA CTGGGAAACG ATTGGAATAA ' 50 

ACCTTATAAA AAATCAGCCC GTGTTGTTGG GGATGTAATC GGTAAATATC 100 

50 ACCCGCACGG TGACAGTGCT GTCTATGAAA CGATTGTTCG TTTAGCACAG 150 

CCTTTTTCTA TGCGCTACAT GTTGGTTGAC GGTCAGGGTA ACTTCGGGTC 200 

AGTTGATGGT GACTCCGCGG CGGCTATGCG TTATACCGAA GTTCGTATGG 250 

CGAAGATCGC CCATGAACTG CTGGCGGATT TGGAAAT^GA GACGGTCGAC 300 
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TTTGTTCCTA ACTATGATGG AACAGAAAAT 
CCGTATTCCA AACTTGTTAG TTAATGGCTC 
TGGCAACGAA TATCCCTCCG CATAACCTCG 
CTTGCCTATG TTGATAATGA AGACATCACC 
5 TATTACCGGG CCTGATTTTC CGACTGCTGC 
GAATATTAGA TGCTTATCGT ACAGGGCGTG 
CAGGCTGATA TTGAAACTGA TGAGAAAACA 
GACAGAAATT CCTTATCAGG TGAATAAAGC 
CGGAGCTTGT AAAAGATAAA CGTATTGAAG 
10 GAGTCTGATA AAGACGGTAT GCGTATTGTT 
AGTCGGTGAA GTAGTATTAA ATCACCTCTT 
TCTCTTTTGG TATTAATATG GTTGCGCTTC 
TTGAACCTAA AAGAAATTAT CGCAGCCTTT 
GGTGACTCGC CGTACCA 

15 



ATACCGGCTG TTATGCCAAC 350 

TTCAGGTATT GCCGTTGGGA 4 00 

GTGAAGTTAT CGACGGTTGT 4 50 

ATAGAAGAAT TAATGGAATA 500 

GATTATTAAT GGTCGCAGAG 550 

GAAAGATTTA TATCCGTGCT 600 

GGTCGCGAAA CCATTATCGT 650 

CCGTTTAATT GAAAAAATTG 700 

GTATCAGCGG ATTACGTGAC 750 

GTTGAGATCA AACGTGATGC 800 

TTCACAAACT CAAATGCAAG 850 

ATC7U\GGCCA ACCAAAATTA 900 

ATTCGCCATC GTCGTGAAGT 950 

967 



2) INFORMATION FOR SEQ ID NO: 1774 

20 (i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 978 bases 

(B) TYPE: Nucleic acid 

(C) STRANDEDNESS: Double 

(D) TOPOLOGY: Linear 



25 



(ii) MOLECULE TYPE: Genomic DNA 



(vi) ORIGINAL SOURCE: 

(A) ORGANISM: Providencia rettgeri 
30 (B) STRAIN: ATCC 9250 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 1774 

CCAGTACACC GCAGAGTATT GTATGCGATG AATGTATTGG GAAATGATTG 50 

35 GAATAAACCC TATAAAAAAT CTGCCCGTAT TGTTGGGGAC GTCATCGGTA 100 

AATACCATCC ACATGGTGAT AGCGCTGTTT ACGAGACAAT CGTTCGTCTT 150 

GCTCAGCCTT TCTCAATGCG TTACATGCTG GTTGATGGTC AGGGAAACTT 200 

CGGTTCTGTT GACGGAGACT CCGCAGCGGC AATGCGTTAT ACGGAAATCC 250 

GTATGGCGAA AATTGCCCAT GAACTACTTG CTGACCTTGA AAAAGAAACC 300 

4 0 GTTGATTTCG TTCCTAACTA TGACGGAACA GAGCACATTC CTGAAGTTAT 350 

GCCAACGAAA ATCCCAAACC TTTTGGTTAA TGGGTCGTCA GGTATTGCTG 400 

TTGGGATGGC AACCAATATC CCACCTCACA ATTTAGGGGA GGTGATTAAT 450 

GGTTGTCTTG CCTATATAGA AGACGAAGAC ATCAGCATTG ATGGTTTAAT 500 

GGT^CACATT CCAGGGCCTG ATTTCCCAAC CGCAGCTATT ATTAATGGCC 550 

45 GTCGTGGGAT TATTGATGCG TATCGCACAG GGCGTGGCAA GGTCTATATC 600 

CGTGCAAGCG CTGAAGTGGA AGTCGATGAG AAAAATGGTC GCGAAACCAT 650 

TATTGTCAGC GAAATTCCTT ATCAAGTGAA TAAAGCTCGC TTGATTGAAA 700 

AAATTGCTGA GTTAGTTAAA GACAAGCGTG TTGAAGGTAT CAGTGCACTG 750 

CGTGACGAGT CTGATAAAGA CGGTATGCGT ATTGTTATTG AAATCAAACG 800 

50 CGATGCGGTG GGTGAAGTTG TACTGAACAA CTTATATTCC CTGACCCAAT 850 

TGCAAGTTTC TTTTGGTATC AATATGGTAG CTCTACACCA AGGGCAGCCG 900 

AAAATACTGA ATTTAAAAGA TATCATTGCT GCTTTTGTGC GTCACCGCCG 950 

TGAAGTCGTC ACTCGTCGTA CGATTTTC 978 

928 
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2) INFORMATION FOR SEQ ID NO: 1775 



SEQUENCE CHARACTERI ST ICS : 

(A) LENGTH: 978 bases 

(B) TYPE: Nucleic acid 

(C) STRANDEDNESS: Double 

(D) TOPOLOGY: Linear 



(ii) 



MOLECULE TYPE: Genomic DNA 



(vi) 



ORIGINAL SOURCE: 

(A) ORGANISM: Proteus vulgaris 

(B) STRAIN: ATCC 13315 



15 



(xi) 



SEQUENCE DESCRIPTION: SEQ 



ID NO: 1775 



CCAGTACACC GTCGCGTTCT TTTCGCGATG AATGTATTAG GAAACGATTG 
GAATAAACCT TATAAAAAAT CAGCCCGTGT TGTTGGGGAT GTTATCGGTA 
AATATCACCC GCACGGTGAC AGTGCTGTTT ATGAAACGAT AGTTCGTTTA 

20 GCACAGCCTT TTTCTATGCG TTACATGTTG GTTGACGGGC AGGGTAACTT 
CGGGTCAGTT GATGGTGACT CGGCGGCTGC AATGCGTTAT ACCGAAGTTC 
GTATGGCGAA AATCGCCCAT GAACTGCTGG CGGATTTGGA AAAAGAAACG 
GTTGATTTTG TTCCTAACTA TGATGGAACA GAGCATATCC CGGCAGTCAT 
GCCAACCCGT ATTCCAAACT TATTAGTCAA TGGTTCTTCA GGTATCGCAG 

25 TCGGGATGGC AACAAACATT CCTCCGCATA ACCTAGGTGA AGTTATCGAC 
GGCTGTCTTG CTTATGTTGA TAACGAAGAC ATCACTATTG AAGAGTTGAT 
GGAGCATATC ACGGGGCCTG ATTTCCCAAC TGCCGCTATT ATTAATGGCC 
GCAGAGGAAT TTTAGATGCT TACCGTACTG GGCGCGGAAA AATTTATATT 
CGTGCACAAG CTGATGTAGA AACCGATGAG AAAACTGGTC GCGAAACAAT 

30 TATCGTGACG GAAATTCCTT ATCAGGTGAA CAAAGCTCGC TTAATTGAAA 
AAATTGCAGA GCTTGTTAAA GATAAACGTA TTGAAGGCAT TAGCGGATTA 
CGTGATGAGT CAGATAAAGA TGGTATGCGC ATTGTTGTTG AAATTAAGCG 
TGATGCTGTT GGTGAAGTTG TACTAAATCA CTTATTTTCT CAGACTCAGA 
TGCAGGTTTC TTTTGGTATT AACATGGTTG CACTGCATCA AGGTCAACCG 

35 AAAGTGTTAA ACCTGAAAGA AATTATTTCA GCCTTTATTC GTCACCGTCG 
TGAAGTGGTG ACTCGTCGTA CTATTTTT 



40 2) INFORMATION FOR SEQ ID NO: 1776 



50 
100 
150 
200 
250 
300 
350 
400 
450 
500 
550 
600 
650 
700 
750 
800 
850 
900 
950 
978 



(i) 



SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 940 bases 

(B) TYPE: Nucleic acid 

(C) STRANDEDNESS: Double 

(D) TOPOLOGY: Linear 



45 



(ii) 



MOLECULE TYPE: Genomic DNA 



50 



(vi) 



ORIGINAL SOURCE: 

(A) ORGANISM: Yersinia enterocolitica 

(B) STRAIN: ATCC 9610 



929 
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(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 1776 

TGGGTAATGA CTGGAATAAG CCATACAAAA AATCGGCCCG TGTAGTCGGG 50 

GACGTTATCG GTAAATATCA CCCGCATGGT GACAGCGCGG TCTACGACAC 100 

5 AATTGTGCGT ATGGCCCAGC CGTTCTCACT GCGCTATATG CTGGTGGATG 150 

GGCAGGGCAA CTTCGGTTCC GTTGATGGCG ACTCCGCCGC AGCGATGCGT 200 

TATACCGAAA TCCGTATGTC TAAAATTGCT CACGAATTGT TGGCGGACTT 250 

AGAAAAAGAT ACCGTCGACT TCGTGCCGAA CTATGACGGT ACGGAGCAAA 300 

TTCCTGCCGT AATGCCAACC CGAATCCCTA ACTTGCTGGT TAACGGCTCG 350 

10 TCAGGTATTG CTGTCGGTAT GGCAACCAAT ATTCCGCCGC ATAACCTTTC 400 

TGAGGTTATT GATGGCTGTC TGGCCTATAT CGAAGATGAA AACATCACCA 4 50 

TTGAAGGGTT GATGGAGTAC ATCCCGGGGC CAGATTTCCC AACTGCTGCG 500 

ATTATCAATG GTCGCCGTGG TATTGAAGAA GCTTATCGTA CTGGCCGTGG 550 

CAAGGTGTAT ATCCGTGCCC GTGCTGAAGT TGAGGCTGAC GCTTAAACCG 600 

15 GTCGCGAAAC CATTATTGTT CACGAGATCC CGTATCAGGT GAACAAGGCG 650 

CGGTTGATTG AAAAAATCGC CGAGCTGGTT AAAGAATW^C GCGTAGAAGG 700 

CATCAGTGCG TTGCGTGATG AGTCTGATAA AGACGGCATG CGTATCGTGA 750 

TTGAAATCAA ACGTGATGCT GTCGGGGAAG TGGTTCTGAA CAACCTCTAT 800 

TCTCTGACGC AATTGCAGGT GACTTTCGGT ATCAATATGG TGGCTCTGTC 850 

20 TCAAGGGCAG CCTAAGTTGC TTAACCTGAA AGACATTTTG GTTGCTTTCG 900 

TGCGCCACCG CCGTGAAGTG GTGACTCGCC GTACCATTTT 940 



25 2) INFORMATION FOR SEQ ID NO: 1777 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 668 bases 

(B) TYPE: Nucleic acid 
30 (C) STRANDEDNESS: Double 

(D) TOPOLOGY: Linear 

(ii) MOLECULE TYPE: Genomic DNA 

35 (vi) ORIGINAL SOURCE: 

(A) ORGANISM: Klebsiella oxytoca 

(B) STRAIN: ATCC 13182 



40. 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 1777 



CCGTTTATTG GCGATGGCCT G/^GCCGGTC CAGCGTCGCA TCGTCTATGC 50 

GATGTCTGAA CTGGGTCTGA ACGCCAGCGC GAAGTTCAT^A AAGTCCGCCC 100 

GCACCGTCGG TGACGTGCTG GGTAAATACC ATCCCCACGG CGACAGCGCG 150 

TGCTATGAAG CCATGGTGCT GATGGCTCAG CCCTTCTCCT ACCGCTATCC 200 

45 GCTGGTTGAC GGTCAGGGAA ACTGGGGGGC GCCGGACGAT CCTAAATCCT 250 

TCGCCGCAAT GCGTTATACC GAATCCCGTT TGTCGAAGTA TGCTGAACTG 300 

CTGCTGAGCG AACTGGGGCA AGGCACCGTT GACTGGGTAC CAAACTTCGA 350 

CGGCACTTTG CAGGAGCCGA AGATGCTGCC TGCGCGCCTG CCCAATATTC 4 00 

TGCTAAACGG TACTACCGGC ATTGCCGTTG GGATGGCGAC GGACATTCCG 4 50 

50 CCGCACAACC TGCGTGAAGT GGCCCGGGCG GCCATTACCC TGATTGAAAA 500 

GCCGCAAACT TCGCTGGATG ACCTGCTGGA TATCGTGCAG GGGCCGGATT 550 

ATCCTACCGA AGCGGAAATC ATTACCCCCC GTGCCGAAAT CCGCAAAATC 600 

TACCAGAATG GCCGCGGTTC GGTGCGGATG CGCGCGGTAT GGGCCAAAGA 650 

930 
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668 



5 2) INFORMATION FOR SEQ ID NO: 1778 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 714 bases 

(B) TYPE: Nucleic acid 
10 (C) STRANDEDNESS : Double 

(D) TOPOLOGY: Linear 

(ii) MOLECULE TYPE: Genomic DNA 

15 (vi) ORIGINAL SOURCE: 

(A) ORGANISM: Klebsiella oxytoca 

(B) STRAIN: ATCC 33496 



20 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 1778 



GTTTATTGGC GATGGCCTGA AGCCGGTCCA GCGTCGCATC GTCTATGCGA 50 

TGTCTGAACT GGGTCTGAAC GCCAGCGCGA AGTTCAAAAA GTCCGCCCGC 100 

ACCGTCGGTG ACGTGCTGGG TAAATACCAT CCCCACGGCG ACAGCGCGTG 150 

CTATGAAGCC ATGGTGCTGA TGGCTCAGCC CTTCTCCTAC CGCTATCCGC 200 

25 TGGTTGACGG TCAGGGAAAC TGGGGGGCGC CGGACGATCC TAAATCCTTC 250 

GCCGCAATGC GTTATACCGA ATCCCGTTTG TCGAAGTATG CTGAACTGCT 300 

GCTGAGCGAA CTGGGGCAAG GCACCGTTGA CTGGGTACCA AACTTCGACG 350 

GCACTTTGCA GGAGCCGAAG ATGCTGCCTG CGCGCCTGCC CAATATTCTG 400 

CTAAACGGTA CTACCGGCAT TGCCGTTGGG ATGGCGACGG ACATTCCGCC 4 50 

30 GCACAACCTG CGTGAAGTGG CCCGGGCGGC CATTACCCTG ATTGAAAAGC 500 

CGCAAACTTC GCTGGATGAC CTGCTGGATA TCGTGCAGGG GCCGGATTAT 550 

CCTACCGAAG CGGAAATCAT TACCCCCCGT GCCGAAATCC GCAAAATCTA 600 

CCAGAATGGC CGCGGTTCGG TGCGGATGCG CGCGGTATGG GCCAAAGAAG 650 

ACGGCGCGGT GGTGATTAGC GCGCTGCCGC ATCAGGTTTC CGGCGCCCGC 700 

35 GTGCTTGAGC AGAT 714 



2) INFORMATION FOR SEQ ID NO: 1779 



40 



(i) SEQUENCE CHARACTERISTICS : 

(A) LENGTH: 722 bases 

(B) TYPE: Nucleic acid 

(C) STRANDEDNESS: Double 
45 (D) TOPOLOGY: Linear 

(ii) MOLECULE TYPE: Genomic DNA 

(vi) ORIGINAL SOURCE: 
50 (A) ORGANISM: Klebsiella pneumoniae subsp. ozaenae 

(B) STRAIN: ATCC 11296 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 1779 
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TTGGCGATGG CTTAAAACCG GTCCAGCGTC 
GAGCTGGGGC TGAACGCCAG CGCGAAATTC 
CGGCGACGTG TTGGGTAAAT ATCACCCGCA 
5 AAGCGATGGT GCTGATGGCG CAGCCGTTCT 
GATGGTCAGG GAAACTGGGG GGCGCCGGAC 
CATGCGTTAC ACCGAATCCC GCCTGTCGAA 
GCGAGCTGGG GCAGGGGACG GTCGACTGGG 
CTGCAGGAGC CGAAAATGCT GCCAGCGCGT 

10 CGGCACCACC GGCATCGCGG TAGGTATGGC 
ACCTGCGGGA AGTGGCCAAA GCGGCGATTA 
ACCACCCTCG ACGAACTGCT GGATATCGTA 
CGAGGCGGAG ATCATCACCT CGCGGGCGGA 
ACGGGCGCGG CTCAGTGCGC ATGCGCGCGG 

15 GCGGTGGTGA TCAGCGCGCT GCCGCATCAG 
GGAGCAGATT GCGGCGCAGA TG 



GCATCGTCTA TGCGATGTCC 50 

AAAAAGTCCG CCCGCACCGT 100 

CGGCGACAGC GCCTGCTATG 150 

CTTACCGCTA TCCGCTGGTG 200 

GATCCCAAAT CTTTCGCCGC 250 

GTATGCCGAG CTGCTGCTCA 300 

TGCCAAACTT TGACGGTACG 350 

TTGCCGAACA TCCTGCTGAA 4 00 

GACCGATATT CCTCCGCACA 4 50 

CGCTGATTGA GCAGCCGAAA 500 

CAGGGGCCGG ATTTCCCGAC 550 

AATTCGAAAA ATCTACCAGA 600 

TGTGGAGTAA AGAGGACGGC 650 

GTCTCCGGCG CCAAAGTGCT 7 00 

722 



20 2) INFORMATION FOR SEQ ID NO: 1780 



(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 692 bases 

(B) TYPE: Nucleic acid 
25 (C) STRANDEDNESS: ■ Double 

(D) TOPOLOGY: Linear 



(ii) MOLECULE TYPE: Genomic DNA 



30 (vi) ORIGINAL SOURCE: 

(A) ORGANISM: Klebsiella planticola 

(B) STRAIN: ATCC 33531 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 1780 

35 

GTCCAGCGTC GCATCGTTTA TGCGATGTCT GAGCTGGGGC TGAACGCCAG 50 

CGCGAAGTTC AAAAAGTCCG CCCGCACCGT GGGTGATGTG CTGGGTAAAT 100 

ATCACCCGCA CGGCGATAGC GCATGCTATG AAGCTATGGT GCTGATGGCG 150 

CAGCCATTCT CCTACCGCTA CCCGCTGGTG GATGGGCAGG GGAACTGGGG 200 

4 0 GGCACCGGAC GATCCTAAAT CCTTCGCCGC GATGCGTTAT ACCGAATCCC 250 

GTTTGTCGAA GTATGCGGAA CTGCTGCTGG GCGAACTGGG GCAGGGAACC 300 

GTCGACTGGG TGCCGAACTT CGACGGGACG ATGCAGGAGC CGAAAATGCT 350 

GCCTGCGCGT CTGCCGAATA TTCTGCTGAA CGGCACTACC GGCATCGCCG 400 

TCGGTATGGC AACCGATATT CCTCCGCACA ACCTGCGTGA AGTGGCGCAG 450 

45 GCGGCGATTA CCCTGATCGA AAAACCGCAG ACCTCGCTCG ACGAACTGCT 500 

GGATATCGTT CACGGACCCG ACTACCCGAC CGAAGCCGAA ATCATTACTC 550 

CACGCGCGGA GATCCGCAAA ATCTACCAGA ACGGCCGCGG TTCGGTGCGG 600 

ATGCGCGCGG TATGGAAAAA AGAGGACGGC GCGGTCGTGA TTACGGCGTT 650 

GCCGCATCAG GTTTCCGGCG CCCGCGTGCT GGAGCAAATT GC 692 

50 



2) INFORMATION FOR SEQ ID NO: 1781 
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(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 700 bases 

(B) TYPE: Nucleic acid 

(C) STRANDEDNESS: Double 

(D) TOPOLOGY: Linear 

(ii) MOLECULE TYPE: Genomic DNA 



10 (vi) ORIGINAL SOURCE: 

(A) ORGANISM: Klebsiella pneumoniae 

(B) STRAIN: ATCC 27336 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 1781 

15 

GTTTATTGGC GATGGCTTAA AACCGGTCCA GCGTCGCATC GTCTATGCGA 50 

TGTCCGAGCT GGGGCTGAAC GCCAGCGCGA AATTCAAAAA GTCCGCCCGC 100 

ACCGTCGGCG ACGTGTTGGG TAAATATCAC CCGCACGGCG ACAGCGCCTG 150 

CTATGAAGCG ATGGTGCTGA TGGCGCAGCC GTTCTCTTAC CGCTATCCGC 200 

2 0 TGGTGGATGG TCAGGGAAAC TGGGGGGCGC CGGACGATCC CAAATCTTTC 250 

GCCGCCATGC GTTACACCGA ATCCCGCCTG TCGAAGTATG CCGAGCTGCT 300 

GCTCAGCGAG CTGGGGCAGG GGACGGTCGA CTGGGTGCCA AACTTTGACG 350 

GCACGCTGCA GGAGCCGAAA ATGCTGCCAG CGCGTTTGCC GAACATCCTG 400 

CTGAACGGCA CCACCGGCAT CGCGGTAGGC ATGGCGACCG ATATTCCTCC 450 

25 GCACAACCTG CGGGAAGTGG CCAAAGCGGC GATTACGCTG ATTGAGCAGC 500 

CGAAAACCAC CCTCGACGAA CTGCTGGATA TCGTACAGGG GCCGGATTTC 550 

CCGACCGAGG CGGAGATCAT CACCTCGCGG GCGGAAATTC GCAAAATCTA 600 

CCAAAACGGG CGCGGCTCAG TGCGCATGCG CGCGGTGTGG AGTAAAGAGG 650 

ACGGCGCGGT GGTGATCAGC GCGCTGCCGC ATCAGGTCTC CGGCGCCAAA 7 00 

30 



2) INFORMATION FOR SEQ ID NO: 1782 

35 (i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 726 bases 

(B) TYPE: Nucleic acid 

(C) STRANDEDNESS: Double 

(D) TOPOLOGY: Linear 



40 



(ii) MOLECULE TYPE: Genomic DNA 



(vi) ORIGINAL SOURCE: 

(A) ORGANISM: Klebsiella pneumoniae subsp. pneumoniae 
45 (B) STRAIN: ATCC 13883 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 1782 

ATTGGCGATG GCTTAAAACC GGTCCAGCGT CGCATCGTCT ATGCGATGTC 50 

50 CGAGCTGGGG CTGAACGCCA GCGCGAAATT CAAAAAGTCC GCCCGCACCG 100 

TCGGCGACGT GTTGGGTAAA TATCACCCGC ACGGCGACAG CGCCTGCTAT 150 

GAAGCGATGG TGCTGATGGC GCAGCCGTTC TCTTACCGCT ATCCGCTGGT 200 

GGATGGTCAG GGAAACTGGG GGGCGCCGGA CGATCCCAAA TCTTTCGCCG 250 

933 
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CCATGCGTTA CACCGAATCC CGCCTGTCGA 
AGCGAGCTGG GGCAGGGGAC GGTCGACTGG 
GCTGCAGGAG CCGAAAATGC TGCCAGCGCG 
ACGGCACCAC CGGCATCGCG GTAGGCATGG 
5 AACCTGCGGG AAGTGGCCAA AGCGGCGATT 
AACCACCCTC GACGAACTGC TGGATATCGT 
CCGAGGCGGA GATCATCACC TCGCGGGCGG 
AACGGGCGCG GCTCAGTGCG CATGCGCGCG 
CGCGGTGGTG ATCAGTGCGC TGCCGCATCA 
10 TGGAGCAGAT TGCGGCGCAG ATGCGC 



PCT/CAOO/01150 

AGTATGCCGA GCTGCTGCTC 300 

GTGCCAAACT TTGACGGCAC 350 

TCTGCCGAAC ATCCTGCTGA 400 

CGACCGATAT TCCTCCGCAC 450 

ACGCTGATTG AGCAGCCGAA 500 

ACAGGGGCCG GATTTCCCGA 550 

AAATTCGCAA AATCTACCAG 600 

GTGTGGAGTA AAGAGGACGG 650 

GGTCTCTGGC GCCAAAGTGC 700 

726 



2) INFORMATION FOR SEQ ID NO: 1783 



15 



(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 706 bases 

(B) TYPE: Nucleic acid 

(C) STRANDEDNESS : Double 
20 (D) TOPOLOGY: Linear 

(ii) MOLECULE TYPE: Genomic DNA 

(vi) ORIGINAL SOURCE: 
25 (A) ORGANISM: Klebsiella pneumoniae subsp. pneumoniae 

(B) STRAIN: ATCC 29011 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 1783 

30 GGCTTAAAAC CGGTCCAGCG TCGCATCGTC TATGCGATGT CCGAGCTGGG 50 

GCTGAACGCC AGCGCGAAAT TCAAAAAGTC CGCCCGCACC GTCGGCGACG 100 

TGTTGGGTAA ATATCACCCG CACGGCGACA GCGCCTGCTA TGAAGCGATG 150 

GTGCTGATGG CGCAGCCGTT CTCTTACCGC TATCCGCTGG TGGATGGTCA 200 

GGGAAACTGG GGGGCGCCGG ACGATCCCAA ATCTTTTGCC GCCATGCGTT 250 

35 ACACCGAATC CCGCCTGTCG AAGTATGCCG AGCTGCTGCT CAGCGAGCTG 300 

GGGCAGGGGA CGGTCGACTG GGTGCCAAAC TTTGACGGCA CGCTGCAGGA 350 

GCCGAAAATG CTGCCAGCGC GTCTGCCGAA CATCCTGCTG AACGGCACCA 400 

CCGGCATCGC GGTAGGCATG GCGACCGATA TTCCTCCGCA CAACCTGCGG 450 

GAAGTGGCCA AAGCGGCGAT TACGCTGATT GAGCAGCCGA AAACCACCCT 500 

40 CGACGAACTG CTGGATATCG TACAGGGGCC GGATTTCCCG ACCGAGGCGG 550 

AGATCATCAC CTCGCGGGCG GAAATTCGCA AAATCTACCA GAACGGGCGC 600 

GGCTCAGTGC GCATGCGCGC GGTGTGGAGT AAAGAGGACG GCGCGGTGGT 650 

GATCAGTGCG CTGCCGCATC AGGTCTCCGG CGCCAAAGTG CTGGAGCAGA 700 

TTGCGG 706 

45 



2) INFORMATION FOR SEQ ID NO: 1784 

50 (i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 614 bases 

(B) TYPE: Nucleic acid 

(C) STRANDEDNESS: Double 

934 
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(D) TOPOLOGY: Linear 

(ii) MOLECULE TYPE: Genomic DNA 

5 (vi) ORIGINAL SOURCE: 

(A) ORGANISM: Klebsiella pneumoniae subsp. 
rhinoscleromatis 

(B) STRAIN: ATCC 13884 

10 (xi) SEQUENCE DESCRIPTION: SEQ ID NO: 1784 

TTGGCGATGG CTTAAAACCG GTCCAGCGTC GCATCGTCTA TGCGATGTCC 50 

GAGCTGGGGC TGAACGCCAG CGCGAAATTC AAAAAGTCCG CCCGCACCGT 100 

CGGCGACGTG TTGGGTAAAT ATCACCCGCA CGGCGACAGC GCCTGCTATG 150 

15 AAGCGATGGT GCTGATGGCG CAGCCGTTCT CTTACCGCTA TCCGCTGGTG 200 

GATGGTCAGG GAAACTGGGG GGCGCCGGAC GATCCCAAAT CTTTCGCCGC 250 

CATGCGTTAC ACCGAATCCC GCCTGTCGAA GTATGCCGAG CTGCTGCTCA 300 

GCGAGCTGGG GCAGGGGACG GTCGACTGGG TGCCAAACTT TGACGGCACG 350 

CTGCAGGAGC CGAAAATGCT GCCAGCGCGT CTGCCGAACA TCCTGCTGAA 400 

2 0 CGGCACCACC GGCATCGCGG TAGGCATGGC GACCGATATT CCTCCGCACA 450 

ACCTGCGGGA AGTGGCCAAA GCGGCGATTA CGCTGATTGA GCAGCCGAAA 500 

ACCACCCTCG ACGAACTGCT GGATATCGTA CAGGGGCCGG ATTTCCCGAC 550 

CGAGGCGGAG ATCATCACCT CGCGGGCGGA AATTCGCTVAA ATCTACCAGA 600 

ACGGGCGCGG CTCA 614 

25 



2) INFORMATION FOR SEQ ID NO: 1785 

30 (i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 668 bases 

(B) TYPE: Nucleic acid 

(C) STRANDEDNESS: Double 

(D) TOPOLOGY: Linear 



35 



(ii) MOLECULE TYPE: Genomic DNA 



(vi) ORIGINAL SOURCE: 

(A) ORGANISM: Klebsiella terrigena 
40 (B) STRAIN: ATCC 33257 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 1785 

GCCTGAAACC GGTCCAGCGG CGCATCGTTT ATGCGATGTC CGAACTGGGT 50 

45 CTGAACGCCA CCGCTAAATT CAAAAAATCC GCGCGCACCG TCGGCGACGT 100 

GCTGGGTAAA TATCACCCGC ACGGCGATAG CGCCTGCTAT GAGGCGATGG 150 

TGCTGATGGC GCAGCCGTTC TCTTACCGCT ACCCGCTGGT GGACGGTCAG 200 

GGCAACTGGG GCGCCCCGGA CGATCCCAAA TCCTTCGCCG CGATGCGTTA 250 

TACCGAATCC CGCCTGTCAA AGTATGCGGA GCTGCTGCTG GGCGAGCTGG 300 

50 GTCAGGGAAC CGTTGACTGG GTACCTAACT TTGACGGTAC GATGCAGGAG 350 

CCGAAAATGC TGCCTGCGCG TTTGCCGAAT ATTCTGCTCA ACGGCACCAC 4 00 

CGGTATCGCC GTGGGGATGG CCACCGATAT TCCGCCGCAC AACCTGCGCG 4 50 

AAGTGGCCT^ AGCGGCCATC ACCCTGATTG AAAAGCCGCA GACCTCGCTC 500 

935 
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GACGAACTGC TGGATATCGT TCACGGGCCG GACTACCCCA CCGAAGCTGA 
AATCATCACC CCGCGCGCCG AGATCCGCAA AATCTATCAG AACGGTCGCG 
GCTCGGTTCG CATGCGTGCG GTGTGGAAAA AAGAGGACGG CGCGGTGGTG 
ATTAGCGCCC TGCCGCAT 



550 
600 
650 
668 



2) INFORMATION FOR SEQ ID NO: 1786 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 113 bases 

(B) TYPE: Nucleic acid 

(C) STRANDEDNESS : Double 

(D) TOPOLOGY: Linear 

(ii) MOLECULE TYPE: Genomic DNA 

(vi) ORIGINAL SOURCE: 

(A) ORGANISM: Bacillus cereus 

(B) STRAIN: ATCC 7064 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 1786 

CATTACGTTC TAACACTCAA GGACGCGGAA CATTCTCTAT GGTGTTTGAC 50 
CACTATGAAG AAGTACCAAA GTCTGTTTCT GAAGAAATTA TCAAAAAAAA 100 
TAAAGGTGAA TAA 113 



2) INFORMATION FOR SEQ ID NO: 1787 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 118 bases 

(B) TYPE: Nucleic acid 

(C) STRANDEDNESS: Double 

(D) TOPOLOGY: Linear 

(ii) MOLECULE TYPE: Genomic DNA 

(vi) ORIGINAL SOURCE: 

(A) ORGANISM: Bacillus cereus 

(B) STRAIN: ATCC 14579 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 1787 

AACGTCATTA CGTTCTAACA CTCAAGGACG CGGAACATTC TCTATGGTGT 50 
TTGACCACTA TGAAGAAGTA CCAAAGTCTG TTTCTGAAGA AATTATCAAA 100 
AAAAATAAAG GTGAATAA 118 



2) INFORMATION FOR SEQ ID NO: 1788 
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(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 120 bases 

(B) TYPE: Nucleic acid 

(C) STRANDEDNESS: Double 
5 (D) TOPOLOGY: Linear 

(ii) MOLECULE TYPE: Genomic DNA 

(vi) ORIGINAL SOURCE: 
10 (A) ORGANISM: Bacillus anthracis 

(B) STRAIN: CIP 9444 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 1788 

15 GCAACGTCAT TACGTTCTAA CACTCAAGGA CGCGGAACAT TCTCTATGGT 50 
GTTTGACCAC TATGAAGAAG TACCAAAGTC TGTTTCTGAA GAAATTATCA 100 
AAAAAAATAA AGGTGAATAA 120 



20 



30 



2) INFORMATION FOR SEQ ID NO: 1789 



(i) SEQUENCE CHARACTERISTICS: 
(A) LENGTH: 118 bases 
25 (B) TYPE: Nucleic acid 

(C) STRANDEDNESS: Double 

(D) TOPOLOGY: Linear 



(ii) MOLECULE TYPE: Genomic DNA 

(vi) ORIGINAL SOURCE: 

(A) ORGANISM: Bacillus cereus 

(B) STRAIN: ATCC 13472 

35 (xi) SEQUENCE DESCRIPTION: SEQ ID NO: 178 9 

AACGTCATTA CGTTCTAACA CTCAAGGACG CGGAACATTC TCTATGGTGT 50 

TTGACCACTA TGAAGAAGTA CCAAAGTCTG TTTCTGAAGA AATTATCAAA 100 

A/UyyiTAAAG GTGAATAA 118 

40 



2) INFORMATION FOR SEQ ID NO: 1790 

45 (i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 120 bases 

(B) TYPE: Nucleic acid 

(C) STRANDEDNESS: Double 

(D) TOPOLOGY: Linear 



50 



(ii) MOLECULE TYPE: Genomic DNA 
(vi) ORIGINAL SOURCE: 
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10 



25 



(A) ORGANISM: Bacillus anthracis 

(B) STRAIN: ATCC 4229 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 1790 

GCAACGTCAT TACGTTCTAA CACTCAAGGA CGCGGAACAT TCTCTATGGT 50 
GTTTGACCAC TATGAAGAAG TACCAAAGTC TGTTTCTGAA GAAATTATCA 100 
AAAAAAATAA AGGTGAATAA 120 



2) INFORMATION FOR SEQ ID NO: 1791 



(i) SEQUENCE CHARACTERISTICS: 
15- (A) LENGTH: 120 bases 

(B) TYPE: Nucleic acid 

(C) STRANDEDNESS: Double 

(D) TOPOLOGY: Linear 

20 (ii) MOLECULE TYPE: Genomic DNA 

(vi) ORIGINAL SOURCE: 

(A) ORGANISM: Bacillus pseudomycoides 

(B) STRAIN: NRRL B-617 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 17 91 



GCAACGTCAT TACGTTCTAA CACGCAAGGA CGCGGAACAT TCTCAATGAC 50 
ATTTGATCAT TATGAAGAAG TACCGAAGTC TGTTTCAGAA GAAATTATCA 100 
30 AAAAAAATAA AGGTGAATAA 120 



2) INFORMATION FOR SEQ ID NO: 1792 



35 



(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 116 bases 

(B) TYPE: Nucleic acid 

(C) STRANDEDNESS: Double 
40 (D) TOPOLOGY: Linear 

(ii) MOLECULE TYPE: Genomic DNA 

(vi) ORIGINAL SOURCE: 
45 (A) ORGANISM: Bacillus cereus 

(B) STRAIN: ATCC 49064 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 1792 

50 CGTCATTACG TTCTAACACT CAAGGACGCG GAACATTCTC TATGGTGTTT 50 
GACCACTATG AAGAAGTACC AAAGTCTGTT TCTGAAGAAA TTATCAAAAA 100 
AAATAAAGGT GAATAA 116 



938 



SUBSTITUTE SHEET (RULE 26) 



wo 01/23604 



PCT/CAOO/01150 



10 



2) INFORMATION FOR SEQ ID NO: 1793 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 120 bases 

(B) TYPE: Nucleic acid 

(C) STRANDEDNESS: Double 

(D) TOPOLOGY: Linear 

(ii) MOLECULE TYPE: Genomic DNA 

(vi) ORIGINAL SOURCE: 

(A) ORGANISM: Bacillus anthracis 

(B) STRAIN: CIP 9440 

15 (xi) SEQUENCE DESCRIPTION: SEQ ID NO: 1793 

GCAACGTCAT TACGTTCTAA CACTCAAGGA CGCGGAACAT TCTCTATGGT 50 

GTTTGACCAC TATGAAGAAG TACCAAAGTC TGTTTCTGAA GAAATTATCA 100 

AAAAAAATAA AGGTGAATAA 120 

20 



2) INFORMATION FOR SEQ ID NO: 1794 

25 (i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 100 bases 

(B) TYPE: Nucleic acid 

(C) STRANDEDNESS: Double 

(D) TOPOLOGY: Linear 



30 



(ii) MOLECULE TYPE: Genomic DNA 



(vi) ORIGINAL SOURCE: 

(A) ORGANISM: Bacillus cereus 
35 (B) STRAIN: ATCC 15816 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 1794 

CACTCAAGGA CGCGGAACAT TCTCTATGGT GTTTGATCAC TATGAAGAAG 50 
40 TACCAAAGTC TGTTTCTGAA GAAATTATCA AAAAAAATAA AGGTGAATAA 100 



45 



2) INFORMATION FOR SEQ ID NO: 1795 



(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 120 bases 

(B) TYPE: Nucleic acid 

(C) STRANDEDNESS: Double 
50 (D) TOPOLOGY: Linear 

(ii) MOLECULE TYPE: Genomic DNA 
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(vi) ORIGINAL SOURCE: 

(A) ORGANISM: Bacillus weihenstephanensis 

(B) STRAIN: WSBC 10204 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 1795 



GCAACGGCAT TACGTTCTAA CACTCAAGGA CGCGGAACAT TCTCAATGAC 
ATTTGATCAT TATGAAGAAG TACCGAAGTC TGTTTCTGAA GAAATTATTA 
AAAAAAATAA AGGTGAATAA 



50 
100 
120 



2) INFORMATION FOR SEQ ID NO: 1796 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 120 bases 

(B) TYPE: Nucleic acid 

(C) STRANDEDNESS : Double 

(D) TOPOLOGY: Linear 

(ii) MOLECULE TYPE: Genomic DNA 

(vi) ORIGINAL SOURCE: 

(A) ORGANISM: Bacillus mycoides 

(B) STRAIN: ATCC 6462 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 17 96 

GCGACAGCAT TACGTTCTAA CACTCAAGGA CGCGGAACAT TCTCAATGAC 50 
ATTTGATCAT TATGAAGAAG TACCGAAGTC TGTTTCAGAA GAAATTATTA 100 
AAAAAAATAA AGGCGAATAA 120 



2) INFORMATION FOR SEQ ID NO: 1797 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 120 bases 

(B) TYPE: Nucleic acid 

(C) STRANDEDNESS: Double 

(D) TOPOLOGY: Linear 

(ii) MOLECULE TYPE: Genomic DNA 

(vi) ORIGINAL SOURCE: 

(A) ORGANISM: Bacillus thuringiensis 

(B) STRAIN: ATCC 107 92 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 1797 

GCAACGTCAT TACGTTCTAA CACTCAAGGA CGCGGAACAT TCTCTATGGT 50 
GTTTGACCAC TATGAAGAAG TACCAAAGTC TGTTTCTGAA GAAATTATCA 100 
AAAAA7\ATAA AGGTGAATAA 120 
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2) INFORMATION FOR SEQ ID NO: 1798 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 117 bases 

(B) TYPE: Nucleic acid 

(C) STRANDEDNESS: Double 

(D) TOPOLOGY: Linear 

(ii) MOLECULE TYPE: Genomic DNA 



(vi) ORIGINAL SOURCE: 

(A) ORGANISM: Bacillus weihenstephanensis 

(B) STRAIN: WSBC 10204 

15 (xi) SEQUENCE DESCRIPTION: SEQ ID NO: 1798 

TTGATTTTTA TCGATTGTTC AAGTATAACT ACTTATGTAA GCTTAGAAAG 50 

TGAGACGCAA GTTTCACTTT CTAGTCTAAA TATAAAATAA CCCATATAAA 100 

CTAAGGAGGA ATTTAGA 117 

20 



2) INFORMATION FOR SEQ ID NO: 1799 

25 (i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 117 bases 

(B) TYPE: Nucleic acid 

(C) STRANDEDNESS: Double 

(D) TOPOLOGY: Linear 



30 



(ii) MOLECULE TYPE: Genomic DNA 



(vi) ORIGINAL SOURCE: 

(A) ORGANISM: Bacillus thuringiensis 
35 (B) STRAIN: ATCC 10792 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 1799 

TTGATTTTTA TCGATTGTTC AAGTATAACT ACTTATGTAA GCTTAGAAAG 50 
40 TGGGACGTAA GTTTCACTTT CTAGTCTAAA TATAAAATAA CCTATATAAA 100 
CTAAGGAGGA ATTTAGA 117 



45 2) INFORMATION FOR SEQ ID NO: 1800 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 117 bases 

(B) . TYPE: Nucleic acid 
50 (C) STRANDEDNESS: Double 

(D) TOPOLOGY: Linear 

(ii) MOLECULE TYPE: Genomic DNA 
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(vi) ORIGINAL SOURCE: 

(A) ORGANISM: Bacillus anthracis 

(B) STRAIN: ATCC 4229 

5 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 1800 

TTGATTTTTA TCGATTCTTC ACGTATAACT ACTTATGTAA GCTTAGAAAG 50 
TGGGACGCAA GTTTCGCTTT CTAGCCTAAA TATAAAATAA CCTATATAAA 100 
10 CTAAGGAGGA ATTTAGA 117 



2) INFORMATION FOR SEQ ID NO: 1801 



15 



(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 117 bases 

(B) TYPE: Nucleic acid 

(C) STRANDEDNESS: Double 
20 (D) TOPOLOGY: Linear 

(ii) MOLECULE TYPE: Genomic DNA 

(vi) ORIGINAL SOURCE: 
25 (A) ORGANISM: Bacillus pseudomycoides 

(B) STRAIN: NRRL B-617 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 1801 

30 TTGATTTTTA TCGATTGTTC AAGTATAACT ACTTATGTAA GCTTAGAAAG 50 
TGGGACTTAA GTTTCACTTT CTAGTCTAAA TATAAAATAA CCTATATAAA 100 
CTAAGGAGGA ATTTAGA 117 



35 



45 



2) INFORMATION FOR SEQ ID NO: 1802 



(i) SEQUENCE CHARACTERISTICS: 
(A) LENGTH: 117 bases 
40 (B) TYPE: Nucleic acid 

(C) STRANDEDNESS: Double 

(D) TOPOLOGY: Linear 



(ii) MOLECULE TYPE: Genomic DNA 

(vi) ORIGINAL SOURCE: 

(A) ORGANISM: Bacillus anthracis 

(B) STRAIN: CIP 9444 

50 (xi) SEQUENCE DESCRIPTION: SEQ ID NO: 1802 

TTGATTTTTA TCGATTCTTC ACGTATAACT ACTTATGTAA GCTTAGAAAG 50 
TGGGACGCAA GTTTCGCTTT CTAGCCTAAA TATAAAATAA CCTATATAAA 100 

942 



SUBSTITUTE SHEET (RULE 26) 



wo 01/23604 
CTAAGGAGGA ATTTAGA 



PCT/CAOO/01150 
117 



5 2) INFORMATION FOR SEQ ID NO: 1803 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 117 bases 

(B) TYPE: Nucleic acid 
10 (C) STRANDEDNESS: Double 

(D) TOPOLOGY: Linear 

(ii) MOLECULE TYPE: Genomic DNA 

15 (vi) ORIGINAL SOURCE: 

(A) ORGANISM: Bacillus cereus 

(B) STRAIN: ATCC 7064 



20 



25 



40 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 1803 

TTGATTTTTA TCGATTCTTC ACGTATAACT ACTTATGTAA GCTTAGAAAG 50 

TGAGACGCAA GTTTCGCTTT CTAGCCTAAA TATAAAATAA CCTATATAAA 100 

CTAAGGAGGA ATTTAGA 117 



2) INFORMATION FOR SEQ ID NO: 1804 



(i) SEQUENCE CHARACTERISTICS: 
30 (A) LENGTH: 117 bases 

(B) TYPE: Nucleic acid 

(C) STRANDEDNESS: Double 

(D) TOPOLOGY: Linear 

35 (ii) MOLECULE TYPE: Genomic DNA 

(vi) ORIGINAL SOURCE: 

(A) ORGANISM Bacillus cereus 

(B) STRAIN: ATCC 4 9064 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 1804 



TTGATTTTTA TCGATTCTTC ACGTATAACT ACTTATGTAA GCTTAGAAAG 50 
TGGGACGCAA GTTTCACTTT CTAGCCTAAA TATAAAATAA CCTATATAAA 100 
45 CTAAGGAGGA ATTTAGA 117 



2) INFORMATION FOR SEQ ID NO: 1805 

50 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 117 bases 

(B) TYPE: Nucleic acid 
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(ii) MOLECULE TYPE: Genomic DNA 

(vi) ORIGINAL SOURCE: 

(A) ORGANISM: Bacillus anthracis 

(B) STRAIN: CIP 9440 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 1814 



10 ATGGCTAAAG CTAAATTCGA 
AATCGGCCAC GTTGACCATG 
CAGTACTTGC AAAAGCTGGT 
GATGCTGCTC CAGAAGAAAG 
CGTTGAGTAC GTU^ACTGAAA 

15 GTCACGCTGA CTATGT 



ACGTTCTAAA CCCCATGTTA 
GTAAAACTAC ATTAACTGCT 
GGTGCTGAAG CACGCGGATA 
AGAGCGCGGT ATCACAATCT 
CTCGTCACTA TGCACACGTT 



ACATCGGTAC 50 
GCGATCACTA 100 
CGATCAAATC 150 
CAACTGCACA 200 
GACTGCCCAG 250 

266 



2) INFORMATION FOR SEQ ID NO: 1815 

20 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 269 bases 

(B) TYPE: Nucleic acid 

(C) STRANDEDNESS: Double 
25 (D) TOPOLOGY: Linear 



(ii) MOLECULE TYPE: Genomic DNA 



(vi) ORIGINAL SOURCE: 
30 (A) ORGANISM: Bacillus cereus 

(B) STRAIN: ATCC 7064 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 1815 



35 ATGGCTAAAG CTAAATTCGA 
AATCGGCCAC GTTGACCATG 
CAGTACTTGC AAAAGCTGGT 
GATGCTGCTC CAGAAGAAAG 
CGTTGAGTAC GAAACTGAAA 

40 GTCACGCTGA CTATGTTAA 



ACGTTCTAAA CCCCATGTTA 
GTAAAACTAC ATTAACTGCT 
GGTGCTGAAG CACGCGGATA 
AGAGCGCGGT ATCACAATCT 
CTCGTCACTA TGCACACGTT 



ACATCGGTAC 50 
GCGATCACTA 100 
CGATCAAATC 150 
CAACTGCACA 200 
GACTGCCCAG 250 

269 



2) INFORMATION FOR SEQ ID NO: 1816 

45 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 268 bases 

(B) TYPE: Nucleic acid 

(C) STRANDEDNESS: Double 
50 (D) TOPOLOGY: Linear 



(ii) MOLECULE TYPE: Genomic DNA 
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(C) STRANDEDNESS: Double 

(D) TOPOLOGY: Linear 

(ii) lyiOLECULE TYPE: Genomic DNA 

(vi) ORIGINAL SOURCE: 

(A) ORGANISM: Bacillus mycoides 

(B) STRAIN: ATCC 64 62 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 1805 

TTGATTTTTA TCAATTGTTC GAGTATAACT ACTTATGTAA GCTTAGAAAG 50 
TGGGACGTAA GTTTCGCTTT CTAGTCTAAA TATAAAATAA CCTATATAAA 100 
CTAAGGAGGA ATTTAGA 117 



2) INFORMATION FOR SEQ ID NO: 1806 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 117 bases 

(B) TYPE: Nucleic acid 

(C) STRANDEDNESS: Double 

(D) TOPOLOGY: Linear 

(ii) MOLECULE TYPE: Genomic DNA 

(vi) ORIGINAL SOURCE: 

(A) ORGANISM: Bacillus cereus 

(B) STRAIN: ATCC 14579 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 1806 

TTGATTTTTA TCGATTGTTC AAGTATAACT ACTTATGTAA GCTTAGAAAG 50 
TGGGACGTAA GTTTCACTTT CTAGTCTAAA TATAAAATAA CCTATATAAA 100 
CTAAGGAGGA ATTTAGA 117 



2) INFORMATION FOR SEQ ID NO: 1807 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 117 bases 

(B) TYPE: Nucleic acid 

(C) STRANDEDNESS: Double 

(D) TOPOLOGY: Linear 

(ii) MOLECULE TYPE: Genomic DNA 

(vi) ORIGINAL SOURCE: 

(A) ORGANISM: Bacillus cereus 

(B) STRAIN: ATCC 15816 
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(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 1807 

TTGATTTTTA TCGATTCTTC ACGTATAACT ACTTATGTAA GCTTAGAAAG 50 
TGGGACGCAA GTTTCGCTTT CTAGCCTAAA TATAAAATAA CCTATATAAA 100 
5 CTAAGGAGGA ATT TAG A 



2) INFOE^MATION FOR SEQ ID NO: 1808 



10 



(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 117 bases 

(B) TYPE: Nucleic acid 

(C) STRANDEDNESS: Double 
15 (D) TOPOLOGY: Linear 

(ii) MOLECULE TYPE: Genomic DNA 

(vi) ORIGINAL SOURCE: 
2 0 (A) ORGANISM: Bacillus cereus 

(B) STRAIN: ATCC 13472 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 1808 

25 TTGATTTTTA TCGATTCTTC AAGTATAACT ACTTATGTAA GCTTAGAAAG 50 
TGGGACGTAA GTTTCACTTT CTAGTCTAT^ TATAAAATAA CCTATATAAA 100 
CTAAGGAGGA ATTTAGA 117 



30 



40 



50 



2) INFORMATION FOR SEQ ID NO: 1809 



(i) SEQUENCE CHARACTERISTICS: 
(A)' LENGTH: 117 bases 
35 (B) TYPE: Nucleic acid 

(C) STRANDEDNESS: Double 

(D) TOPOLOGY: Linear 



(ii) MOLECULE TYPE: Genomic DNA 

(vi) ORIGINAL SOURCE: 

(A) ORGANISM: Bacillus anthracis 

(B) STRAIN: CIP 9440 

45 (xi) SEQUENCE DESCRIPTION: SEQ ID NO: 1809 

TTGATTTTTA TCGATTCTTC ACGTATAACT ACTTATGTAA GCTTAGAAAG 50 
TGGGACGCAA GTTTCGCTTT CTAGCCTAAA TATAAAATAA CCTATATAAA 100 
CTAAGGAGGA ATTTAG H'^ 



2) INFORMATION FOR SEQ ID NO: 1810 
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(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 278 bases 

(B) TYPE: Nucleic acid 

5 (C) STRANDEDNESS: Double 

(D) TOPOLOGY: Linear 

(ii) MOLECULE TYPE: Genomic DNA 

10 (vi) ORIGINAL SOURCE: 

(A) ORGANISM: Bacillus mycoides 

(B) STRAIN: ATCC 6462 



15 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 1810 



ATGGCTAAAG CTAAATTCGA ACGTTCTAAA CCCCATGTTA ACATCGGTAC 50 

AATCGGCCAC GTTGACCATG GTAAAACTAC ATTAACTGCT GCGATCACTA 100 

CAGTTCTTGC AAAAGCTGGT GGTGCTGAAG CACGCGGATA CGATCAAATC 150 

. GACGCTGCTC CAGAAGAAAG AGAGCGCGGA ATCACAATCT CAACTGCACA 200 

2 0 CGTTGAGTAC GAAACTGAAA CTCGTCACTA TGCACACGTT GACTGCCCAG 250 

GTCACGCTGA CTATGTTAAA AACATGAT 278 



25 2) INFORMATION FOR SEQ ID NO: 1811 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 278 bases 

(B) TYPE: Nucleic acid 

3 0. (C) STRANDEDNESS: Double 

(D) TOPOLOGY: Linear 

(ii) MOLECULE TYPE: Genomic DNA 

35 (vi) ORIGINAL SOURCE: 

(A) ORGANISM: Bacillus thuringiensis 

(B) STRAIN: ATCC 107 92 



40 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 1811 



ATGGCTAAAG CTAAATTCGA ACGTTCTAAA CCCCATGTTA ACATCGGTAC 50 

AATCGGCCAC GTTGACCATG GTAAAACTAC ATTAACTGCT GCGATCACTA 100 

CAGTTCTTGC AAAAGCTGGT GGTGCTGAAG CACGCGGATA CGATCAAATC 150 

GATGCTGCTC CAGAAGAAAG AGAGCGCGGT ATCACAATCT CAACTGCACA 200 

45 CGTTGAGTAC GAAACTGAAA CTCGTCACTA TGCACACGTT GACTGCCCAG 250 

GTCACGCTGA CTATGTTAAA AACATGAT 278 



50 2) INFORMATION FOR SEQ ID NO: 1812 



(i) SEQUENCE CHARACTERISTICS: 
(A), LENGTH: 270 bases 
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(B) TYPE: Nucleic acid 

(C) STRANDEDNESS: Double 

(D) TOPOLOGY: Linear 

(ii) MOLECULE TYPE: Genomic DNA 

(vi) ORIGINAL SOURCE: 

(A) ORGANISM: Bacillus cereus 

(B) STRAIN: ATCC 15816 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 1812 

ATGGCTAAAG CTAAATTCGA ACGTTCTAAA CCCCATGTTA ACATCGGTAC 50 

AATCGGCCAC GTTGACCATG GTAAAACTAC ATTAACTGCT GCGATCACTA 100 

CAGTACTTGC AAAAGCTGGT GGTGCTGAAG CACGCGGATA CGATCAAATC 150 

GATGCTGCTC CAGAAGAAAG AGAGCGCGGT ATCACAATCT CAACTGCACA 200 

CGTTGAGTAC GAAACTGAAA CTCGTCACTA TGCACACGTT GACTGCCCAG 250 

GTCACGCTGA CTATGTTAAA 270 



2) INFORMATION FOR SEQ ID NO: 1813 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 278 bases 

(B) TYPE: Nucleic acid 

(C) STRANDEDNESS: Double 

(D) TOPOLOGY: Linear 

(ii) MOLECULE TYPE: Genomic DNA 

(vi) ORIGINAL SOURCE: 

(A) ORGANISM: Bacillus weihenstephanensis 

(B) STRAIN: WSBC 10204 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 1813 

ATGGCTAAAG CTAAATTCGA ACGTTCTAAA CCCCATGTTA ACATCGGTAC 50 

AATCGGCCAC GTTGACCATG GTAAAACTAC ATTAACTGCT GCGATCACTA 100 

CAGTTCTTGC AAAAGCTGGT GGTGCTGAAG CACGCGGATA CGATCAAATC 150 

GACGCTGCTC CAGAAGAAAG AGAGCGCGGA ATCACAATCT CAACTGCACA 200 

CGTTGAGTAC GAAACTGAAA CTCGTCACTA TGCACACGTT GACTGCCCAG 250 

GTCATGCTGA CTATGTTAAA AACATGAT 278 



2) INFORMATION FOR SEQ ID NO: 1814 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 266 bases 

(B) TYPE: Nucleic acid 

(C) STRANDEDNESS: Double 

(D) TOPOLOGY: Linear 
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(vi) ORIGINAL SOURCE: 

(A) ORGANISM: Bacillus cereus 

(B) STRAIN: ATCC 13472 

5 (xi) SEQUENCE DESCRIPTION: SEQ ID NO: 1816 

ATGGCTAAAG CTAAATTCGA ACGTTCTAAA CCCCATGTTA ACATCGGTAC 50 

AATCGGCCAC GTTGACCATG GTAAAACTAC ATTAACTGCT GCGATCACTA 100 

CAGTTCTTGC AAAAGCTGGT GGTGCTGAAG CACGCGGATA CGATCAAATC 150 

10 GATGCTGCTC CAGAAGAAAG AGAGCGCGGT ATCACAATCT CAACTGCACA 200 

CGTTGAGTAC GAAACTGAAA CTCGTCACTA TGCACACGTT GACTGCCCAG 250 

GTCACGCTGA CTATGTTA 268 



15 



25 



2) INFORMATION FOR SEQ ID NO: 1817 



(i) SEQUENCE CHARACTERISTICS: 
(A) LENGTH: 278 bases 
20 (B) TYPE: Nucleic acid 

(C) STRANDEDNESS: Double 

(D) TOPOLOGY: Linear 



(ii) MOLECULE TYPE: Genomic DNA 

(vi) ORIGINAL SOURCE: 

(A) ORGANISM: Bacillus anthracis 

(B) STRAIN: ATCC 4229 

30 (xi) SEQUENCE DESCRIPTION: SEQ ID NO: 1817 

ATGGCTAAAG CTAAATTCGA ACGTTCTAAA CCCCATGTTA ACATCGGTAC 50 

AATCGGCCAC GTTGACCATG GTAAAACTAC ATTAACTGCT GCGATCACTA 100 

CAGTACTTGC AAAAGCTGGT GGTGCTGAAG CACGCGGATA CGATCT^AATC 150 

35 GATGCTGCTC CAGAAGAAAG AGAGCGCGGT ATCACAATCT CAACTGCACA 200 

CGTTGAGTAC GAAACTGAAA CTCGTCACTA TGCACACGTT GACTGCCCAG 250 

GTCACGCTGA CTATGTTAAA AACATGAT 278 



40 



2) INFORMATION FOR SEQ ID NO: 1818 



(i) SEQUENCE CHARACTERISTICS: 
(A) LENGTH: 268 bases 
45 (B) TYPE: Nucleic acid 

(C) STRANDEDNESS: Double 

(D) TOPOLOGY: Linear 



50 



(ii) MOLECULE TYPE: Genomic DNA 

(vi) ORIGINAL SOURCE: 

(A) ORGANISM: Bacillus cereus 

(B) STRAIN: ATCC 14579 . 
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(xi) SEQUENCE DESCRIPTION: SEQ 

ATGGCTAAAG CTAAATTCGA ACGTTCTAAA 
5 AATCGGCCAC GTTGACCATG GTAAAACTAC 
CAGTTCTTGC AAAAGCTGGT GGTGCTGAAG 
GATGCTGCTC CAGAAGAAAG AGAGCGCGGT 
CGTTGAGTAC GAAACTGAAA CTCGTCACTA 
GTCACGCTGA CTATGTTA 



ID NO: 1818 

CCCCATGTTA ACATCGGTAC 50 

ATTAACTGCT GCGATCACTA 100 

CACGCGGATA CGATCAAATC 150 

ATCACAATCT CAACTGCACA 200 

TGCACACGTT GACTGCCCAG 250 

268 



2) INFORMATION FOR SEQ ID NO: 1819 

15 (i) SEQUENCE CHAFU\CTERISTICS : 

(A) LENGTH: 278 bases 

(B) TYPE: Nucleic acid 

(C) STRANDEDNESS: Double 

(D) TOPOLOGY: Linear 



20 



(ii) MOLECULE TYPE: Genomic DNA 



(vi) ORIGINAL SOURCE: 

(A) ORGANISM: Bacillus anthracis 
25 (B) STRAIN: CIP 9444 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 1819 

ATGGCTAAAG CTAAATTCGA ACGTTCTAAA CCCCATGTTA ACATCGGTAC 50 

30 AATCGGCCAC GTTGACCATG GTAAAACTAC ATTAACTGCT GCGATCACTA 100 

CAGTACTTGC AAT^GCTGGT GGTGCTGAAG CACGCGGATA CGATCAAATC 150 

GATGCTGCTC CAGAAGAAAG AGAGCGCGGT ATCACAATCT CAACTGCACA 200 

CGTTGAGTAC GAAACTGAAA CTCGTCACTA TGCACACGTT GACTGCCCAG 250 

GTCACGCTGA CTATGTTAAA AACATGAT 278 

35 



2) INFORMATION FOR SEQ ID NO: 1820 

40 (i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 278 bases 

(B) TYPE: Nucleic acid 

(C) STRANDEDNESS: Double 

(D) TOPOLOGY: Linear 



45 



(ii) MOLECULE TYPE: Genomic DNA 



(vi) ORIGINAL SOURCE: 

(A) ORGANISM: Bacillus pseudomycoldes 
50 (B) STRAIN: NRRL B-617 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 1820 
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ATGGCTAAAG CTAAATTCGA ACGTTCTAAA CCCCATGTTA ACATCGGTAC 
AATCGGCCAC GTTGACCATG GTAAAACTAC ATTAACTGCT GCGATCACTA 
CAGTTCTTGC AAAAGCTGGT GGTGCTGAAG CACGCGGATA CGACCAAATC 
GATGCTGCTC CAGAAGAAAG AGAGCGCGGT ATCACAATCT CAACTGCACA 
CGTTGAGTAC GAAACTGAAA CTCGTCACTA TGCACACGTT GACTGCCCAG 
GTCACGCTGA CTATGTTAAA AACATGAT 



50 
100 
150 
.200 
250 
278 



2) INFORMATION FOR SEQ ID NO: 1821 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 263 bases 

(B) TYPE: Nucleic acid 

(C) STRANDEDNESS: Double 

(D) TOPOLOGY: Linear 

(ii) MOLECULE TYPE: Genomic DNA 

(vi) ORIGINAL SOURCE: 

(A) ORGANISM: Bacillus cereus 

(B) STRAIN: ATCC 4 9064 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 1821 

ATGGCTAAAG CTAAATTCGA ACGTTCTAAA CCCCATGTTA ACATCGGTAC 50 

AATCGGCCAC GTTGACCATG GTAAAACTAC ATTAACTGCT GCGATCACTA 100 

CAGTACTTGC AAAAGCTGGT GGTGCTGAAG CACGCGGATA CGATCAAATC 150 

GATGCTGCTC CAGAAGAAAG AGAGCGCGGT ATCACAATCT CAACTGCACA 200 

CGTTGAGTAC GAAACTGAAA CTCGTCACTA TGCACACGTT GACTGCCCAG 250 

GTCACGCTGA CTA 263 



2) INFORMATION FOR SEQ ID NO: 1822 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 1668 bases 

(B) TYPE: Nucleic acid 

(C) STRANDEDNESS: Double 

(D) TOPOLOGY: Linear 

(ii) MOLECULE TYPE: Genomic DNA 

(vi) ORIGINAL SOURCE: 

(A) ORGANISM: Streptococcus oralis 

(B) STRAIN: ATCC 35037 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 1822 

CAATCGAAGT ACAACGTTCT CTTCGWGTAT TGGACGGTGC GGTTACTGTT 50 
CTTGACTCAC AATCAGGTGT TGAGCCTCAA ACTGAAACAG TTTGGCGTCA 100 
AGCAACTGAG TACGGAGTTC CACGTATCGT ATTTGCTAAC AAAATGGACA 150 
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AAATCGGTGC TGACTTCCTT TACTCAGTAA GCACACTTCA CGACCGTCTT 200 

CAAGCAAACG CACACCCAAT CCAATTGCCA ATCGGTGCTG AAGATGACTT 250 

CCGTGGTATC ATCGACTTGA TCAAGATGAA AGCTGAAATC TATACTAACG 300 

ACCTTGGTAC AGATATCCTT GAAGAAGATA TTCCAGCTGA ATACCTTGAC 350 

5 CAAGCTCAAG AATACCGTGA AAAATTGGTT GAAGCAGTCG CTGAAACTGA 400 

TGAAGACTTG ATGATGAAAT ACCTTGAAGG TGAAGAAATC ACTAACGAAG 4 50 

AATTGAAAGC TGCTATCCGT AAAGCAACTA TCAACGTTGA ATTCTTCCCA 500 

GTATTGTGTG GTTCTGCCTT CAAGAACAAG GGTGTTCAAT TGATGCTTGA 550 

TGCGGTTATC GACTACCTTC CAAGCCCACT TGATATCCCA GCGATCAAAG 600 

10 GTATCAACCC AGATWCAGAT GAAGAAGAAA CTCGTCCAGC ATCTGACGAA 650 

GAGCCATTCG CAGCTCTTGC CTTCAAGATC ATGACGGACC CATTTGTAGG 700 

TCGTTTGACA TTCTTCCGTG TATACTCARG TGTTCTCCAA TCARGKTCTT 750 

ACGTATTGAA CACATCTAAA GGTAAACGTG AACGTATCGG ACGTATCCTT 800 

CAAATGCACG CTAACAGCCG TCAAGAAATT GACACTGTTT ACTCAGGTGA 850 

15 TATCGCTGCT GCCGTTGGTT TGAAAGATAC TYCAACTGGT GACTCATTGM 900 

CAGATGAAAA AGCTAAAATC ATCCTTGAGT CAATCAACGT TCCAGAMCCA 950 

GTTATCCAAT TGATGGTTGA GCCAAAATCT AAAGCTGACC AAGATAAGAT 1000 

GGGTATCGCC CTTCAAAAAT TGGCTGAAGA AGATCCAACA TTCCGCGTTG 1050 

AAMCAAACGT TGAAACTGGT GAAMCAGTTA TCTCAGGTAT GGGTGAGCTT 1100 

2 0 CACTTGGACG TCCTTGTTGA CCGTATGCGT CGTGAGTTCA AAGTGGAAGC 1150 

GAACGTAGGT GCTCCTCAAG TATCTTACCG TGAAACATTC CGCGCTTCTA 1200 

CTCAAGCACG TGGATTCTTC AAACGTCAGT CTGGTGGTAA AGGTCAATTC 1250 

GGTGATGTAT GGATTGAATT TACTCCAAAC GAAGAAGGTA AAGGATTCGA 1300 

ATTCGAAAAC GCAATCGTCG GTGGTGTGGT TCCTCGTGAA TTTATCCCAG 1350 

25 CGGTTGAAAA AGGTTTGGTA GAATCTATGG CTAACGGTGT TCTTGCAGGT 1400 

TACCCAATGG TTGACGTTAA AGCTAAGCTT TACGATGGTT CATACCACGA 14 50 

TGTCGACTCA TCTGAAACTG CCTTCAAGAT CGCGGCTTCA CTTGCCCTTA 1500 

AAGAAGCTGC TAAGTCAGCA CAACCAGCTA TCCTTGAGCC AATGATGCTT 1550 

GTAACCATCA CTGTTCCAGA AGAAAACCTT GGTGATGTTA TGGGTCACGT 1600 

30 AACTGCTCGT CGTGGACGTG TAGATGGTAT GGAAGCACWC GGTAACAGCC 1650 

AAATCGTTCG TGCTTACG 1668 



35 2) INFORMATION FOR SEQ ID NO: 1823 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 115 bases 

(B) TYPE: Nucleic acid 
40 (C) STRANDEDNESS: Double 

(D) TOPOLOGY: Linear 

(ii) MOLECULE TYPE: Genomic DNA 

45 (vi) ORIGINAL SOURCE: 

(A) ORGANISM: Budvicla a qua t lea 

(B) STRAIN: ATCC 35567 



50 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 1823 

AGACCTGCGT TCACAAACAC AGGGTCGTGC TTCTTACTCT ATGGAGTTCT 50 
TGAAGTACAA CGAAGCGCCA AACAACGTTG CTACAGCAAT CATTGAAGCT 100 
CGTAAGGCTA GATAA 115 

952 



SUBSTITUTE SHEET (RULE 26) 



wo 01/23604 



PCT/CAOO/01150 



2) INFORMATION FOR SEQ ID NO: 1824 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 107 bases 

(B) TYPE: Nucleic acid 

(C) STRANDEDNESS: Double 

(D) TOPOLOGY: Linear 

(ii) MOLECULE TYPE: Genomic DNA 

(vi) ORIGINAL SOURCE: 

(A) ORGANISM: Buttiauxeiia agrestis 

(B) STRAIN: ATCC 33320 

15 (xi) SEQUENCE DESCRIPTION: SEQ ID NO: 1824 

CTGCGTTCAC TGACCAAGGT CGTGCATCTT ACTCCATGGA ATTCCTGAAG 50 

TATGATGACG CGCCAAACAA CGTAGCTCAG GCCGTAATCG AAGCTCGCGG 100 

TAAATAA 107 

20 



10 



2) INFORMATION FOR SEQ ID NO: 1825 

25 (i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 7 9 bases 

(B) TYPE: Nucleic acid 

(C) STRANDEDNESS: Double 

(D) TOPOLOGY: Linear 



30 



(ii) MOLECULE TYPE: Genomic DNA 



(vi) ORIGINAL SOURCE: 

(A) ORGANISM: Klebsiella oxytoca 
35 (B) STRAIN: ATCC 13182 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 1825 

TTACTCCATG GAGTTCCTGA AGTATGATGA TGCGCCGAAC AACGTTGCTC 50 
40 AGGCCGTAAT CGAAGCCCGT GGTAAATAA 79 



45 



2) INFORMATION FOR SEQ ID NO: 1826 



(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 111 bases 

(B) TYPE: Nucleic acid 

(C) STRANDEDNESS: Double 
50 (D) TOPOLOGY: Linear 

(ii) MOLECULE TYPE: Genomic DNA 
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(vi) ORIGINAL SOURCE: 

(A) ORGANISM: Plesiomonas shigelloides 

(B) STRAIN: ATCC 14029 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 1826 



CAGCTGCGTT CTCTGACCAA AGGTCGTGCA TCATACACTA TGGAATTCCT 
GAAGTATGAT GATGCGCCAA ACAACGTTGC TCAGGCCGTT ATTGAAGCCC 
GTGGTAAGTA A 



50 
100 
111 



2) INFORMATION FOR SEQ ID NO: 1827 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 108 bases 

(B) TYPE: Nucleic acid 

(C) STRANDEDNESS: Double 

(D) TOPOLOGY: Linear 

(ii) MOLECULE TYPE: Genomic DNA 

(vi) ORIGINAL SOURCE: 

(A) ORGANISM: Shewanella putrefaciens 

(B) STRAIN: ATCC 8071 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 1827 

GATTTGCGCT CTGCAACTCA TGGGCGTGCT TCGTACTCCA TGGAGTTCTT 50 
GAAGTACTCT GATGCACCGC AAAACATTGC GAAAGCGATT ATTGAATCTC 100 
GTAGCTAA 108 



2) INFORMATION FOR SEQ ID NO: 1828 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 113 bases 

(B) TYPE: Nucleic acid 

(C) STRANDEDNESS: Double 

(D) TOPOLOGY: Linear 

(ii) MOLECULE TYPE: Genomic DNA 

(vi) ORIGINAL SOURCE: 

(A) ORGANISM: QbesuTiibacteriuin proteus 

(B) STRAIN: ATCC 12841 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 1828 

CTCAGCTGCG TTCTCTGACC AAAGGTCGTG CATCTTACTC CATGGAATTC 50 
CTGAAGTATG ATGATGCGCC TAACAACGTT GCTCAGGCCG TTATTGAAGC 100 
TCGTGGCAAA TAA 113 
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2) INFORMATION FOR SEQ ID NO: 1829 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 70 bases 

(B) TYPE: Nucleic acid 

(C) STRANDEDNESS: Double 

(D) TOPOLOGY: Linear 

(ii) MOLECULE TYPE: Genomic DNA 

(vi) ORIGINAL SOURCE: 

(A) ORGANISM: Klebsiella oxytoca 

(B) STRAIN: ATCC 13182 

15 (xi) SEQUENCE DESCRIPTION: SEQ ID NO: 1829 

GCCGCAGGGT TAAAACCAAA GTCCCGTGCT CTCTCCTGAA GGGGAGAGCA 50 
CTATAGTAAG GAATATAGCC 70 



10 



20 



35 



2 ) INFORMATION FOR SEQ ID NO: 1830 



(i) SEQUENCE CHARACTERISTICS: 
25 (A) LENGTH: 66 bases 

(B) TYPE: Nucleic acid 

(C) STRANDEDNESS: Double 

(D) TOPOLOGY: Linear 

30 (ii) MOLECULE TYPE: Genomic DNA 

(vi) ORIGINAL SOURCE: 

(A) ORGANISM: Budvicia aquatica 

(B) STRAIN: ATCC 35567 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 1830 

GCCTCGGGTA AAACTTATAT CCCAGTCCCC CTCGTATAGA GGGGGATAGA 50 
GTAAAGGAAG ATAATC 66 



40 



2) INFORMATION FOR SEQ ID NO: 1831 

45 (i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 81 bases 

(B) TYPE: Nucleic acid 

(C) STRANDEDNESS: Double 

(D) TOPOLOGY: Linear 



50 



(ii) MOLECULE TYPE: Genomic DNA 
(vi) ORIGINAL SOURCE: 
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(A) ORGANISM: Plesiomonas shigelloidBS 

(B) STRAIN: ATCC 14029 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 1831 

5 

TCCACAGGAT TAAAACCCAG GTTTAAACCT AAGTCCCGTG CTCTCTCCTC 50 
AGGGGAGAGC ACAATAGTAA GGAATATAGC C 81 



10 



20 



30 



45 



50 



2) INFORMATION FOR SEQ ID NO: 1832 



(i) SEQUENCE CHARACTERISTICS: 
(A) LENGTH: 70 bases 
15 (B) TYPE: Nucleic acid 

(C) STRANDEDNESS: Double 

(D) TOPOLOGY: Linear 



(ii) MOLECULE TYPE: Genomic DNA 

(vi) ORIGINAL SOURCE: 

(A) ORGANISM: Obesumbacterium proteus 

(B) STRAIN: ATCC 12841 

25 (xi) SEQUENCE DESCRIPTION: SEQ ID NO: 1832 

GCTACTAGTT TAAAACATTG ATCCCGTGCT CTCTCTATGA AGGGAGAGCA 50 
CAAGAGTAAG GAATAAAGCC 70 



2) INFORMATION FOR SEQ ID NO: 1833 



(i) SEQUENCE CHARACTERISTICS: 
35 (A) LENGTH: 72 bases 

(B) TYPE: Nucleic acid 

(C) STRANDEDNESS: Double 

(D) TOPOLOGY: Linear 

40 (ii) MOLECULE TYPE: Genomic DNA 

(vi) ORIGINAL SOURCE: 

(A) ORGANISM: Shewanella putrefaciens 

(B) STRAIN: ATCC 8071 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 1833 



TTTCCAGTTA CGACATAAAT GTTATTATGG TCCAGCTTTG ACTGGACTAT 50 
TCTGAAAAGA AAGGAATATA TC 72 



2) INFORMATION FOR SEQ ID NO: 1834 
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(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 73 bases 

(B) TYPE: Nucleic acid 

5 (C) STRANDEDNESS: Double 

(D) TOPOLOGY: Linear 

(ii) MOLECULE TYPE: Genomic DNA 

10 (vi) ORIGINAL SOURCE: 

(A) ORGANISM: Buttiauxella agrestis 

(B) STRAIN: ATCC 33320 



15 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 1834 

GCCCCGGGTT TTAAAAAACA TTGATCCCGT GCTCTCTCCA GAAGGGGAGA 50 
GCGCAACAGT AAGGAATATA GCC 73 



20 



30 



2) INFORMATION FOR SEQ ID NO: 1835 



(i) SEQUENCE CHARACTERISTICS: 
(A) LENGTH: 7 95 bases 
25 (B) TYPE: Nucleic acid 

(C) STRANDEDNESS: Double 

(D) TOPOLOGY: Linear 



(ii) MOLECULE TYPE: Genomic DNA 

(vi) ORIGINAL SOURCE: 

(A) ORGANISM: CampyJojbacter coli 

(B) STRAIN: ATCC 43479 

35 (xi) SEQUENCE DESCRIPTION: SEQ IDvNO: 1835 

CTGCAGCTGA TGGTCCTATG CCACAAACTA GAGAGCACAT CCTTCTATCA 50 

CGCCAAGTAG GTGTTCCATA TATCGTTGTA TTTATGAATA AAGCAGATAT 100 

GGTTGATGAT GCTGAACTTT TAGAATTGGT TGAAATGGAA ATTAGAGAAT 150 

40 TATTAAGCTC TTATGATTTC CCAGGTGATG ACACACCTAT TATTTCAGGT 200 

TCTGCTTTAA AAGCTCTTGA AGAAGCAAAA GCTGGACAAG ATGGCGAATG 250 

GTCAGCGAAA ATTATGGATC TTATGGCTGC TGTTGATAGC TATATTCCAA 300 
CTCCAACTCG TGACACTGAA AAAGATTTCT TAATGCCAAT TGAAGATGTT - 350 

TTCTCAATTT CAGGTCGTGG TACTGTTGTT ACAGGTAGAA TTGAAA7UVGG 400 

45 TATTGTAAAA GTTGGTGATA CTATAGAAAT CGTTGGTATT AAAGATACTC 450 

AAACAACAAC TGTAACTGGC GTTGAAATGT TTAGAAAAGA AATGGACCAA 500 

GGTGAAGCAG GGGATAATGT TGGTGTTCTT CTTCGTGGTA CAAAAAAAGA 550 

AGAAGTTATC CGCGGTATGG TTCTTGCTAA ACCAAAATCA ATTACTCCAC 600 

ATACTGATTT CGAAGCTGAA GTTTATATCC TAAATA/^GA TGAGGGTGGT 650 

50 AGACATACTC CATTCTTTAA TAACTATAGA CCGCAATTCT ATGTAAGAAC 700 

/y^CAGATGTA ACAGGTTCTA TTAAATTAGC TGATGGCGTT GAAATGGTTA 750 

TGCCTGGTGA AAATGTAAGA ATTACTGTAA GCTTGATTGC ACCAG 7 95 
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2) INFORMATION FOR SEQ ID NO: 1836 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 817 bases 

(B) TYPE: Nucleic acid 

(C) STRANDEDNESS: Double 

(D) TOPOLOGY: Linear 

(ii) MOLECULE TYPE: Genomic DNA 

(vi) ORIGINAL SOURCE: 

(A) ORGANISM: Campylobacter fetus subsp. fetus 

(B) STRAIN: ATCC 25936 

15 (xi) SEQUENCE DESCRIPTION: SEQ ID NO: 1836 

GCCATACTAG TTGTTTCTGC AGCTGATGGC CCAATGCCAC AAACTAGAGA 50 

GCACATTTTG CTATCTCGTC AAGTTGGTGT TCCATATATA GTTGTTTTTA 100 

TGAACAAAGC TGATATGGTA GATGACGCAG AGTTGCTAGA ATTAGTTGT^ 150 

20 ATGGAGATCA GAGAGTTATT AAGCGAATAT GACTTCCCTG GTGATGATAC 200 

TCCTATTATA AGCGGATCAG CACTTCAAGC tCTTGAAGAA GCTAAAGCTG 250 

GTAATGATGG CGAATGGTCA GCTAAGATTA TGGATCTTAT GGCTGCTGTT 300 

GATAGCTACA TACCAACTCC AGTTCGTGCT ACTGATAAAG ATTTCTTAAT 350 

GCCGATTGAA GACGTATTCT CAATTTCTGG CCGTGGTACT GTTGTTACTG 400 

25 GTAGAATTGA AAAAGGTATA GTTAAAGTTG GTGATACTAT CGAAATCGTA 450 

GGTATTAGAG ATACACAAAC TACAACAGTT ACCGGCGTTG AAATGTTTAG 500 

AAAAGAAATG GATCAAGGCG AGGCTGGTGA TT^CGTTGGT GTTCTTTTAC 550 

GCGGTACAAA GAAAGAAGAC GTTGAAAGAG GTATGGTTCT TTGTAAGCCA 600 

AAATCAATTA CTCCTCATAC TAAATTTGAG GGAGAAGTTT ATATCTTGAC 650 

30 TAAGGAAGAG GGCGGTAGAC ATACTCCATT CTTCAACAAC TATAGACCAC 700 

AATTTTATGT AAGAACAACA GATGTTACTG GATCAATCAC TCTTCCAGAG 750 

GGTACTGAGA TGGTTATGCC TGGTGATAAC TTAAAAATCA CTGTTGAGTT 800 

AATCAACCCA GTTGCTC 817 

35 

2) INFORMATION FOR SEQ ID NO: 1837 

(i) SEQUENCE CHARACTERISTICS: 
40 (A) LENGTH: 798 bases 

(B) TYPE: Nucleic acid 

(C) STRANDEDNESS: Double 

(D) TOPOLOGY: Linear 

45 (ii) MOLECULE TYPE: Genomic DNA 

(vi) ORIGINAL SOURCE: 

(A) ORGANISM: Campylobacter fetus subsp. venerealis 

(B) STRAIN: ATCC 33561 



50 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 1837 

CCATACTAGT TGTTTCTGCA GCTGATGGCC CAATGCCACA AACTAGAGAG 50 
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CACATTTTGC TATCTCGTCA AGTTGGTGTT CCATATATAG TTGTTTTTAT 100 

GAACAAAGCT GATATGGTAG ATGACGCAGA GTTGCTAGAA TTAGTTGAAA 150 

TGGAGATCAG AGAGTTATTA AGCGAATATG ACTTCCCTGG TGATGATACT 200 

CCTATTATAA GCGGATCAGC ACTTCAAGCT CTTGAAGAAG CTAAAGCTGG 250 

5 TAATGATGGC GAATGGTCAG CTAAGATTAT GGATCTTATG GCTGCTGTTG 300 

ATAGCTACAT ACCAACTCCA GTTCGTGCTA CTGATAAAGA TTTCTTAATG 350 

CCGATTGAAG ACGTATTCTC GATTTCTGGC CGTGGTACTG TTGTTACTGG 400 

TAGAATTGAA AAAGGTATAG TTAAAGTTGG TGATACTATC GAAATCGTAG 450 

GTATTAGAGA TACACAAACT ACAACAGTTA CCGGCGTTGA AATGTTTAGA 500 

10 AAAGAAATGG ATCAAGGCGA GGCTGGTGAT AACGTTGGTG TTCTTTTACG 550 

CGGTACA/y^G AAAGAAGACG TTGAAAGAGG TATGGTTCTT TGTT^GCCAA 600 

AATCAATTAC TCCTCATACT AAATTTGAGG GAGAAGTTTA TATCTTGACT 650 

AAGGAAGAGG GCGGTAGACA TACTCCATTC TTCAACAACT ATAGACCACA 7 00 

ATTTTATGTA AGAACAACAG ATGTTACTGG ATCAATCACT CTTCCAGAGG 750 

15 GTACTGAGAT GGTTATGCCT GGTGATAACT TAAAAATCAC TGTTAGTT 798 



2) INFOEIMATION FOR SEQ ID NO: 1838 



20 



(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 1116 bases 

(B) TYPE: Nucleic acid 

(C) STRANDEDNESS: Double 
25 (D) TOPOLOGY: Linear 

(ii) MOLECULE TYPE: Genomic DNA 

(vi) ORIGINAL SOURCE: 
30 (A) ORGANISM: Buttiauxella agrestis 

(B) STRAIN: ATCC 33320 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 1838 

35 GTGTCTAAAG AAAAATTTGA ACGTACAAAA CCGCACGTCA ACGTCGGTAC 50 

TATCGGCCAC GTTGACCATG GTAAAACTAC TCTGACTGCA GCAATCACTA 100 

CCGTTCTGGC TAAAACCTAC GGCGGTTCTG CACGCGCATT CGACCAGATC 150 

GATAACGCAC CAGAAGAAAA AGCTCGTGGT ATCACCATCA ACACTTCCCA 200 

CGTTGAATAT GACACCCCGA CTCGTCACTA CGCGCACGTT GACTGCCCAG 250 

40 GGCACGCCGA CTACGTTAAA AACATGATCA CCGGTGCTGC TCAGATGGAC 300 

GGCGCTATCC TGGTTGTTGC TGCGACTGAT GGCCCGATGC CACAGACTCG 350 

TGAGCACATC CTGCTGGGTC GCCAGGTTGG CGTTCCATTC ATGATCGTGT 400 

TCATGAACAA ATGTGACATG GTTGATGACG AAGAGCTGCT GGAACTGGTA 450 

GAAATGGAAG TTCGTGAACT TCTGTCTGCT TATGATTTCC CGGGCGACGA 500 

45 CATCCCAGTG GTTCGTGGTT CAGCTCTGAA AGCGCTGGAA GGCGAAGCAG 550 

AGTGGGAAGC TAAAATCATC GAGCTGGCTG GTCACCTGGA TAACTACATC 600 

CCAGAACCAG AGCGTGCTAT CGACAAGCCA TTCCTGCTGC CAATCGT^^GA 650 

CGTATTCTCC ATCTCCGGCC GTGGTACTGT TGTTACTGGT CGTGTAGAGC 700 

GTGGTATCAT TAAAGTTGGT GAAGAAGTAG AAATCGTTGG TATCAAAGAT 750 

50 ACCGTGAAAT CTACCTGTAC TGGCGTTGAA ATGTTCCGCA T^CTGCTGGA 800 

CGAAGGCCGT GCTGGTGAGA ACGTTGGTGT TCTGCTGCGT GGTATTAAAC 850 

GTGAAGATAT CGAACGTGGT CAGGTTCTGG CTAAGCCAGG CTCTATCAAG 900 

CCGCACACTC AGTTCGAATC AGAAGTTTAT ATCCTGTCCA AAGATGAAGG 950 
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CGGCCGTCAT ACTCCGTTCT TCAAAGGCTA 
GTACAACTGA CGTGACTGGC ACCATCGAAC 
GTAATGCCGG GCGAC7\ACAT TCAAATGGTT 
CGCAATGGAC GACGGT 
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CCGTCCACAG TTCTACTTCC 1000 
TGCCAGAAGG CGTTGAGATG 1050 
GTTACCCTGA TCCACCCAAT 1100 

1116 



2) INFORMATION FOR SEQ ID NO: 1839 



10 (i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 1109. bases 

(B) TYPE: Nucleic acid 

(C) STRANDEDNESS: Double 

(D) TOPOLOGY: Linear 



15 



(ii) MOLECULE TYPE: Genomic DNA 



(vi) ORIGINAL SOURCE: 

(A) ORGANISM: Klebsiella oxytoca 
20 (B) STRAIN: ATCC 13182 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 1839 



GTGTCTAAAG AAAAATTTGA ACGTACAAAA CCGCACGTCA ACGTCGGTAC 50 

25 TATCGGCCAC GTTGACCATG GTAAAACTAC TCTGACCGCT GCAATCACTA 100 

CCGTTCTGGC TAAAACCTAC GGTGGTGCTG CTCGCGCATT CGACCAGATC 150 

GATAACGCGC CGGAAGAAAA AGCTCGTGGT ATCACCATCA ACACTTCCCA 200 

CGTTGAATAT GACACCCCGA CTCGCCACTA CGCGCACGTA GACTGCCCGG 250 

GCCACGCCGA CTATGTTAAA AACATGATCA CCGGTGCTGC GCAGATGGAC 300 

30 GGCGCGATCC TGGTTGTTGC TGCGACTGAC GGCCCGATGC CGCAGACTCG 350 

TGAGCACATC CTGCTGGGTC GTCAGGTAGG CGTTCCGTAC ATCATCGTGT 400 

TCCTGAACAA GTGCGACATG GTTGATGACG AAGAGCTGCT GGAACTGGTT 450 

GAAATGGAAG TTCGTGAACT TCTGTCTCAG TACGATTTCC CGGGCGACGA 500 

CACTCCGATC GTTCGTGGTT CTGCTCTGAA AGCTCTGGAA GGCGACGCTG 550 

35 AGTGGGAATC TAAAATCATC GAACTGGCTG GCTTCCTGGA TTCTTATATT 600 

CCGGAACCAG AGCGTGCGAT TGACAAGCCG TTCCTGCTGC CGATCGAAGA 650 

CGTATTCTCC ATCTCCGGTC GTGGTACCGT TGTTACCGGT CGTGTAGAGC 700 

GCGGTATCAT CAAAGTTGGC GAAGAAGTTG AAATCGTTGG TATCAAAGAC 750 

ACTGCTAAGT CTACCTGTAC TGGCGTTGAA ATGTTCCGCA AACTGCTGGA 800 

4 0 CGAAGGCCGT GCTGGTGAGA ACGTTGGTGT TCTGCTGCGT GGTATCAAAC 850 

GTGAAGAAAT CGAACGTGGT CAGGTACTGG CTAAGCCGGG CTCTATCAAG 900 

CCGCACACCA AGTTCGAATC TGAAGTTTAT ATCCTGTCCA AAGACGAAGG 950 

CGGCCGTCAC ACTCCGTTCT TCAAAGGCTA CCGTCCGCAG TTCTACTTCC 1000 

GTACAACTGA CGTGACTGGC ACCATCGAAC TGCCGGAAGG CGTAGAGATG 1050 

45 GTTATGCCGG GCGACAACAT CAAAATGGTT GTTACCCTGA TCCACCCGAT 1100 

CGCGATGGA 1109 



50 2) INFORMATION FOR SEQ ID NO: 1840 



(i) SEQUENCE CHARACTERISTICS: 
(A) LENGTH: 1108 bases 



960 
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(B) TYPE: Nucleic acid 

(C) STRANDEDNESS: Double 

(D) TOPOLOGY: Linear 

(ii) MOLECULE TYPE: Genomic DNA 

(vi) ORIGINAL SOURCE: 

(A) ORGANISM: Plesiomonas shigelloides 

(B) STRAIN: ATCC 14029 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 1840 



GTGTCTAAAG AAAAATTTGA ACGTACTAAA CCGCACGTTA ACGTTGGTAC 50 

TATCGGCCAC GTTGACCACG GTAAAACTAC CCTGACTGCA GCTATCACTA 100 

15 CCGTACTGTC TAAAGTATAC GGTGGTCAGG CTCGTGCATT CGATCAGATC 150 

GATAACGCGC CAGAAGAAAA AGCTCGTGGT ATCACCATCA ACACTTCTCA 200 

CGTAGAGTAC GACACCCCAA CTCGTCACTA CGCGCACGTT GACTGCCCAG 250 

GTCACGCCGA CTACGTGAAG AACATGATCA CTGGTGCTGC TCAGATGGAC 300 

GGCGCTATCC TGGTAGTAGC TGCGACTGAC GGCCCAATGC CTCAGACTCG 350 

2 0 TGAGCACATC CTGCTGGGTC GTCAGGTAGG CGTTCCTTAC ATCATCGTGT 400 

TCCTGAACAA GTGTGACATG GTTGATGACG AAGAGCTGCT GGAACTGGTA 4 50 

GAAATGGAAG TACGTGAGCT GCTGTCTCAG TACGACTTCC CAGGCGACGA 500 

TACTCCAGTT GTTCGCGGTT CTGCACTGAA AGCGCTGGAA GGCGATGCTC 550 

AGTGGGAAGA GAAGATTGTT GAACTGGCAG GCTACCTGGA CAGCTACATC 600 

25 CCTGAGCCAG AGCGTGCTAT CGACAAGCCA TTCCTGCTGC CAATCGAAGA 650 

CGTATTCTCT ATCTCCGGCC GTGGTACTGT AGTAACTGGT CGTGTAGAGC 7 00 

GCGGTATCAT CAAAGTTGGT GAAGAAGTAG AAATCGTTGG TATCAAAGAG 750 

ACTACCAAGA CTACTTGTAC TGGCGTTGAA ATGTTCCGTA AGCTGCTGGA 800 

CGAAGGTCGT GCGGGCGAGA ACGTAGGTGT TCTGCTGCGT GGTACCAAGC 8 50 

30 GTGATGACGT AGAGCGTGGT CAGGTTCTGG CTAAGCCAGG CTCAATCAAC 900 

CCACACACCA ACTTTGTAGC AGAAGTTTAT ATTCTGTCCA AAGATGAAGG 950 

TGGTCGTCAC ACTCCATTCT TCAAAGGCTA CCGTCCACAG TTCTACTTCC 1000 

GTACAACTGA CGTGACCGGT ACCATCGAAC TGCCAGAAGG CGTAGAGATG 1050 

GTAATGCCAG GTGACAACAT TCAAATGGTT GTTACCCTGA TTGCACCAAT 1100 

35 CGCGATGG 1108 



2) INFOE^yiATION FOR SEQ ID NO: 1841 



10 



(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 1107 bases 

(B) TYPE: Nucleic acid 

(C) STRANDEDNESS: Double 
15 (D) TOPOLOGY: Linear 

(ii) MOLECULE TYPE: Genomic DNA 

(vi) ORIGINAL SOURCE: 
50 (A) ORGANISM: Shewanella putrefaciens 

(B) STRAIN: ATCC 8071 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 1841 

961 
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TGGCAA?^GC TAAATTTGAA CGTATTAAGC CTCACGTAAA CGTGGGCACC 50 

ATTGGTCACG TTGACCATGG TAAAACCACT CTGACTGCAG CTATCTCTCA 100 

CGTACTGGCT AAGACCTACG GTGGCGAAGC TAAAGACTTC TCTCAAATCG 150 

5 ATAACGCTCC AGAAGAGCGT GAGCGCGGTA TTACCATCAA TACCTCTCAC 200 

ATCGAATATG ACACGCCATC ACGCCACTAC GCCCACGTAG ACTGCCCAGG 250 

CCACGCTGAC TATGTTAAAA ACATGATCAC TGGTGCTGCA CAGATGGACG 300 

GCGCGATTCT GGTAGTCGCT TCAACAGACG GTCCAATGCC ACAGACTCGT 350 

GAGCACATCC TGCTTTCTCG TCAGGTTGGC GTACCATTCA TCATCGTATT 4 00 

10 CATGAACAAA TGTGACATGG TAGATGACGA AGAGCTGTTA GAGCTAGTTG 4 50 

AGATGGAAGT GCGTGAACTG TTATCAGAAT ACGATTTCCC AGGTGATGAC 500 

TTACCGGTAA TCCAAGGTTC AGCTCTGAAA GCGCTAGAAG GCGAGCCAGA 550 

GTGGGAAGCA AAAATCCTTG AATTAGCAGC GGCGCTGGAT TCTTACATTC 600 

CAGAACCACA ACGTGACATC GATAAGCCGT TCCTACTGCC AATCGAAGAC 650 

15 GTATTCTCAA TTTCAGGCCG TGGTACAGTA GTAACAGGTC GTGTTGAGCG 700 

TGGTATTGTA CGCGTAGGCG ACGAAGTTGA AATCGTTGGT GTACGTGCGA 750 

CAACTAAGAC AACGTGTACT GGTGTAGAAA TGTTCCGTAA ACTGCTTGAC 800 

GAAGGTCGTG CAGGTGAGAA CTGTGGTATT TTGTTACGTG GTACTAAGCG 850 

TGATGACGTA GAACGTGGTC TU^GTATTAGC GAAGCCAGGT TCAATCAACC 900 

2 0 CACACACTAC TTTTGAATCA GAAGTTTACG TACTGTCAAA AGAAGAAGGT 950 

GGTCGTCACA CGCCATTCTT CAAAGGCTAC CGTCCACAGT TCTACTTCCG 1000 

TACAACTGAC GTAACCGGTA CTATCGAACT GCCAGAAGGC GTAGAGATGG 1050 

TAATGCCAGG CGATAACATC AAGATGGTAG TGACACTGAT TTGCCCAATC 1100 

GCGATGG 1107 

25 



2)INF0I^TI0N FOR SEQ ID NO: 1842 

30 (i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 1116 bases 
. (B) TYPE: Nucleic acid 

(C) STRANDEDNESS: Double 

(D) TOPOLOGY: Linear 



35 



(ii) MOLECULE TYPE: Genomic DNA 



(vi) ORIGINAL SOURCE: 

(A) ORGANISM: Obesumbacterium proteus 
40 (B) STRAIN: ATCC 12841 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 1842 

GTGTCTAAAG AAAAATTTGA ACGTACAAT^ CCGCACGTTA ACGTTGGTAC 50 

45 AATCGGCCAC GTTGACCACG GTAAAACTAC CCTGACTGCT GCAATCACTA 100 

CCGTTCTGGC TAAAACCTAC GGTGGTTCTG CACGTGCATT CGACCAGATC 150 

GATAACGCGC CAGAAGAAAA AGCTCGTGGT ATCACCATCA ACACTTCTCA 200 

CGTTGAATAT GACACCCCGA CTCGCCACTA CGCACACGTA GACTGCCCAG 250 

GCCACGCCGA CTATGTTAAA AACATGATCA CCGGTGCTGC GCAGATGGAC 300 

50 GGCGCGATCC TGGTTGTTGC TGCGACTGAC GGCCCTATGC CTCAGACTCG 350 

TGAGCACATC CTGCTGGGTC GTCAGGTTGG CGTTCCTTAC ATCATCGTAT 4 00 

TCCTGAACAA ATGCGACATG GTTGATGATG AAGAGCTGCT GGAGCTGGTA 4 50 

GAAATGGAAG TTCGTGAACT TCTGTCTCAG TACGACTTCC CAGGCAATGA 500 

962 
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TACTCCAATC ATCCGTGGTT CTGCTCTGAA AGCGCTGGAA GGCGAAGCTG 550 

AGTGGGAAGC TAAGATCGTA GAACTGGCTG AAACTCTGGA TTCTTACATC 600 

CCAGAACCAG AACGTGCTAT CGACAAGCCA TTCCTGCTGC CAATCGAAGA 650 

CGTATTCTCA ATCTCTGGCC GTGGTACTGT TGTTACCGGT CGTGTAGAGC 700 

5 GCGGTATCGT TAAAGTTGGT GAAGAAGTTG AGATTGTTGG TATCAAAGAT 750 

ACCGTT/W^T CAACTTGTAC CGGCGTTGAA ATGTTCCGTA AACTGCTGGA 800 

CGAAGGTCGT GCAGGCGAGA ACGTTGGTGT TCTGCTGCGT GGTATCAAGC 850 

GTGAAGACAT CGAACGTGGT CAGGTTCTGG CTAAACCAGG TTCTATCAAA 900 

CCACACACCA AGTTCGAATC AGAAGTTTAT ATTCTGAGCA T^GATGAAGG 950 

10 CGGCCGTCAT ACTCCGTTCT TCAAAGGCTA CCGTCCACAG TTCTACTTCC 1000 

GTACAACTGA CGTGACCGGT ACCATCGAAC TGCCAGAAGG CGTGGAAATG 1050 

GTAATGCCAG GCGATAACAT CAAAATGATC GTTACCCTGA TCCACCCAAT 1100 

CGCAATGGAC GATGGT 1116 

15 

2) INFORMATION FOR SEQ ID NO: 1843 

(i) SEQUENCE CHARACTERISTICS: 
20 (A) LENGTH: 1129 bases 

(B) TYPE: Nucleic acid 

(C) STRANDEDNESS: Double 

(D) TOPOLOGY: Linear 

25 (ii) MOLECULE TYPE: Genomic DNA 

(vi) ORIGINAL SOURCE: 

(A) ORGANISM: Budvicia aquatica 

(B) STRAIN: ATCC 35567 



30 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 1843 



GTGTCTAAAG AAAAATTTGA ACGTACAAAA CCGCACGTTA ACGTTGGTAC 50 

TATCGGCCAC GTTGACCACG GTAAAACAAC TCTGACTGCT GCTATCACCA 100 

35 GCGTTTTAGC TAAAACTTAT GGCGGTAACG CTCGTGCATT CGATCAAATC 150 

GATAATGCAC CAGAAGAAAA AGCACGTGGT ATCACCATCA ACACTTCTCA 200 

CGTTGAGTAT GATACTCCTG CTCGCCACTA CGCACACGTA GACTGCCCAG 250 

GACACGCCGA CTATGTGAAA AACATGATCA CCGGTGCTGC TCAAATGGAC 300 

GGCGCGATCT TAGTTGTTGC GGCAACTGAT GGTCCTATGC CACAGACTCG 350 

40 TGAGCACATC CTGTTAGGTC GTCAGGTTGG CGTTCCTTAC ATCATCGTGT 400 

TCCTGAACAA GTGTGACATG GTTGATGACG AAGAACTGTT AGAATTAGTT 4 50 

GAAATGGAAG TTCGTGAGCT TCTTTCTGCT TATGATTTCC CTGGTGACGA 500 

TACTCCAGTT GTTCGTGGTT CTGCGCTGAA AGCGTTAGAA GGCGAAGCTG 550 

AGTGGGAAGC TAAGATCATT GAATTAGCCG GATATCTGGA TAGCTACATC 600 

45 CCAGAGCCAG AGCGTGCGAT TGACCGTCCG TTCCTGCTGC CAATCGAAGA 650 

CGTATTCTCT ATTTCAGGCC GTGGTACAGT TGTTACCGGT CGTGTAGAGC 700 

GCGGAATCGT TAAAGTCGGT GAAGCCGTTG AGATTGTTGG TATCAAAGAT 750 

ACCGTACAAA CTACCTGTAC TGGCGTTGAA ATGTTCCGTA AGTTACTTGA 800 

CGAAGGCCGT GCTGGTGAGA ACGTTGGTGT TCTGCTGCGT GGTACTAAGC 850 

50 GTGAAGAAAT CGAACGCGGT CAAGTACTGG CTAAGCCAGG TTCAATCAAC 900 

CCGCACACCA ACTTTGTATC AGAAGTTTAT ATTCTGAGCA AAGATGAAGG 950 

TGGTCGTCAT ACTCCATTCT TCAAAGGCTA CCGTCCACAG TTCTACTTCC 1000 

GTACAACTGA CGTGACCGGT ACTATCGAAC TGCCAGAAGG CGTAGAGATG 1050 

963 
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GTGATGCCTG GTGACAACAT TCAGATGACT GTAACTCTGA TTGCACCAAT J. 100 
CGCGATGGAC GAAGGTTTAC GCTTCGCTA 1129 



2) INFORMATION FOR SEQ ID NO: 1844 

(i) SEQUENCE CHARACTERISTICS: 
(A) LENGTH: 810 bases 
10 (B) TYPE: Nucleic acid 

(C) STRANDEDNESS : Double 

(D) TOPOLOGY: Linear 



15 



(ii) MOLECULE TYPE: Genomic DNA 

(vi) ORIGINAL SOURCE: 

(A) ORGANISM: Abiotrophia adiacens 

(B) STRAIN: ATCC 49175 

20 (xi) SEQUENCE DESCRIPTION: SEQ ID NO: 1844 

ATATTCATAA TGCATTACAA GTTGTGAAAA CAACAAGTGA CGGAAGTGAA 50 

AAGACTGTTA CATTGGAAAC TGCTGTAGAA TTAGGGGATG GTGCAGTTCG 100 

TACGATTGCC ATGGAATCTA CAGATGGTTT GCAACGTGGC ATGAAAGTAG 150 

25 TGGACTTAGG ACGCACAATT AGCGTTCCTG TGGGACCTGA AACATTAGGT 2 00 

CGTGTATTCA ACGTTTTAGG AGATACAATC GACTTGAAAG AACCATTCCC 250 

AGAAGACTTT ACAAGACATG AAATCCATAA ACCAGCACCA AAATTTGAAG 300 

AATTAAACAG TCAATATGAA ATTCTACAAA CAGGGATTAA AGTTATTGAC 350 

CTTTTAGCAC CTTATCTTAA AGGTGGTAAA ATCGGTTTAT TCGGTGGTGC 4 00 

30 CGGTGTAGGG AAAACCGTAT TAATTCAAGA ATTAATTCAT AATATCGCTG 450 

AAGAACTTGG TGGTATTTCA GTATTTACAG GGGTAGGGGA ACGTACTCGT 500 

GAAGGGAATG ACCTTTACCA TGAAATGCAA GAATCAGGCG TATCTGCTAA 550 

AACAGCGATG GTGTTTGGGC AAATGAACGA ACCACCAGGA GCTCGTATGC 600 

GTGTAGCACT AACAGGGTTA ACTATTGCGG AATACTTCCG TGATATGGAA 650 

35 AAACAAGACG TGCTTTTATT CATCGATAAC ATTTATCGTT TCACGCAAGC 700 

AGGTTCAGAA GTGTCAGCGT TACTTGGTCG TATGCCTTCT GCCGTAGGGT 750 

ATCAACCAAC ATTAGCGACA GAAATGGGTC AATTACT^GA ACGTATCAGT 800 

TCAACTAAAG 810 



40 



2) INFORMATION FOR SEQ ID NO: 1845 



(i) SEQUENCE CHARACTERISTICS: 
45 (A) LENGTH: 815 bases 

(B) TYPE: Nucleic acid 

(C) STRANDEDNESS: Double 

(D) TOPOLOGY: Linear 

50 (ii) MOLECULE TYPE: Genomic DNA 

(vi) ORIGINAL SOURCE: 

(A) ORGANISM: Arcanobacterium haemolyticum 
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(B) STRAIN: ATCC 9345 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 184 5 

5 TGCTCTGCCA GAAATTAACA ACGCCCTCCT CACCGAAGTA GACCTCTCCG 50 

GCCAAGGCGA AGGCGAAAGC GTTCTCAAGA TGACTCTTGA GGTTGCTCAG 100 

CACCTCGGCG ATAACATCGT CCGTACCATC GCCATGAAGC CAACCGACGG 150 

TCTGGTTCGC GGCGCCACCG TTATCGATAC CGGCGCCCCA ATCACCGTGC 200 

CAGTTGGCGA CGCAACTAAA GGTCATGTTT TCAACGTGAC CGGTGATGTC 250 

10 CTAAACTTGG GCGAAGGCGA AACCCTTGAC GTCAAGGAAC GGTGGCCAAT 300 

CCACCGCAAG GCTCCACAGT TCGACGAACT CGAACCGGAA ACCAAGATGT 350 

TCGAAACAGG CATCAAGGTG ATCGATCTCC TCACCCCATA CGTACAGGGC 4 00 

GGCAAGATCG GTCTGTTTGG CGGTGCTGGT GTTGGTAAGA CCGTTCTTAT 4 50 

CCAGGAAATG ATCCAGCGTG TTGCACAGGA TCATGGCGGT GTGTCCGTGT 500 

15 TCGCGGGTGT GGGTGAACGT ACCCGTGAAG GTAACGATCT TATCCACGAA 550 

ATGGAAGhTG CGGGCGTTCT TGATAAGACC GCGCTTGTGT TCGGCCAGAT 600 

GGATGAACCG CCAGGGGTTC GTTTGCGTAT TGCACTTTCC GGCCTGACCA 650 

TGGCGGAATA CTTCCGTGAC GTGCAAAACC AGGACGTGCT TTTGTTCATC 700 

GATAACATCT TCCGCTTCAC CCAGGCAGGT TCGGAAGTGT CCACGTTGCT 750 

20 TGGCCGTATG CCATCAGCAG TGGGCTACCA GCCGACCTTG GCAGATKAAA 800 

TGGGCGCATT GCAGG 815 



25 2) INFORMATION FOR SEQ ID NO: 1846 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 1073 bases 

(B) TYPE: Nucleic acid 
30 (C) STRANDEDNESS: Double 

(D) TOPOLOGY: Linear 

(ii) MOLECULE TYPE: Genomic DNA 

35 (vi) ORIGINAL SOURCE: 

(A) ORGANISM: Basidiobolus ranarum 

(B) STRAIN: ATCC 24670 



40 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 1846 



ACCTTCCTCC TATCTTGAAH GCCCTGGAGG TCCAAAACCA CAGCTCTCGY 50 

TTAGTTTTAG AGGTGTCCCA GCATTTGGGT GAAAACACCG TTCGTACTAT 100 

TGCTATGGAC GGTACTGAAG GATTGGTTCG TGGTCAAAAT GTCGTAGATA 150 

CCGGATATCC TATTAGAGTT CCTGTCGGTC CTGAATGTTT GGGTCGTATC 200 

45 ATGAACGTTA TTGGCGAGCC TGTTGATGAG CGCGGYCCTA TCAAGACCAA 250 

GAAGCTTGCA CCCATCCACG CTTCTCCCCC CGAGTTCGTY GACCAATCCA 300 

CCACCCCCGA AATCTTGGAG ACTGGTATTA AGGTTGTCGA TTTGTTGGCC 350 

CCTTACGCTC GTGGTGGTAA GATCGGTCTT TTCGGTGGTG CCGGTGTCGG 400 

TAAGACTGTG TTTATCCAGG AGTTGATYAA CAACGTTGCC AAGGCCCACG 4 50 

50 GTGGTTACTC CGTGTTCGCT GGTGTTGGTG AGCGTACTCG TGAGGGTAAC 500 

GATTTGTACC ACGAGATGAT TCAAACTGGT GTCATCAAGC TTGATGGCCA 550 

ATCCAAGGCT GCCCTTGTCT ACGGMCAAAT GAACGAGCCC CCAGGTGCTC 600 

GTGCCCGMGT CGCTTTGACC GGTCTTACCG TTGCTGAATA CTTCCGTGAT 650 

965 
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GAGGAAGGHC AAGATGTGTT 
CCAAGCTGGT TCTGAAGTGT 
TCGGTTACCA ACCCACCTTG 
ATTACCACCA CCAAGAAGGG 
5 CCCTGCTGAT GATTTGACCG 
TTGACGCCAC CACCGTGTTG 
CCCGCTGTCG ATCCCCTCGA 
TGTCGGTGAA GAGCACTACG 
ARTCTTACAA GTCTCTCCAG 



GCTCTTCATT GACAACATTT 
CYGCCTTGTT GGGTCGTATC 
GCCACCGATA TGGGTGTCAT 
TTCCATTACC TCTGTCCAGG 
ATCCCGCTCC TGCCACTACT 
TCTCGTTCCA TCTCTGAGTT 
CTCCAAGTCT CGTATGTTGG 
ACATCGCCAC TGGTGTTCAG 
GAT 



PCT/CAOO/01150 

TCCGTTTCAC 700 

CCCTCCGCTG 750 

GCAAGAGCGT 800 

CCATTTACGT 850 

TTTGCCCATC 900 

GGGTATTTAC 950 

ATCCYCGTAT 1000 

AAGATTCTCC 1050 
1073 



2) INFORMATION FOR SEQ ID NO: 1847 

15 (i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 480 bases 

(B) TYPE: Nucleic acid 

(C) STRANDEDNESS : Double 

(D) TOPOLOGY: Linear 



20 



40 



(ii) MOLECULE TYPE: Genomic DNA 



(vi) ORIGINAL SOURCE: 

(A) ORGANISM: Blastomyces dermatitidis 
25 (B) STRAIN: ATCC 56220 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 1847 

TGTCTTTATC CAGGAGTTGA TTGTACGTCT TGCCTCACCC TTTGGGTATT 50 

30 TTGCGAATAC TAATTATAGT AGAACAACAT TGCCAAGGCT CACGGTGGTT 100 

ACTCTGTCTT CACTGGTGTC GGTGAACGTA CTCGTGAGGG TAACGATTTG 150 

TACCACGAAA TGCAGGAAAC TGGTGTCATT CAGCTCGAGG GTGAATCCAA 200 

GGTCGCCCTC GTGTTCGGTC AGATGAACGA GCCCCCTGGT GCCCGTGCCC 250 

GTGTCGCTCT TACTGGTTTG ACCATTGCCG AGTACTTCCG TGACGAGGAG 300 

35 GGTCAAGATG TGCTTCTCTT CATTGACAAC ATTTTCCGTT TCACTCAGGC 350 

CGGTTCTGAG GTGTCTGCCC TTTTGGGTCG TATCCCCTCT GCCGTCGGTT 400 

ACCAGCCCAC TCTCGCCGTC GACATGGGTG TCATGCAGGA GCGTATTACC 450 

ACCACCACCA AGGGTTCCAT CACCTCCGTC 480 



2) INFORMATION FOR SEQ ID NO: 1848 



(i) SEQUENCE CHARACTERISTICS: 
45 (A) LENGTH: 566 bases 

(B) TYPE: Nucleic acid 

(C) STRANDEDNESS: Double 

(D) TOPOLOGY: Linear 

50 (ii) MOLECULE TYPE: Genomic DNA 

(vi) ORIGINAL SOURCE: 

(A) ORGANISM: Blastomyces dermatitidis 

966 
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(B) STRAIN: ATCC 14112 

(xi) SEQUENCE DESCRIPTION: SEQ 

5 TGTCTTCATT CAGGAGTTGA TTGTACGTCC 

GGCGAGGAAA ATTTTTGGCT TTTTCTAATA 

GCCAAAGCCC ACGGTGGTTA CTCCGTTTTC 

CCGTGAAGGA AACGATTTGT ACCACGAGAT 

AGCTCGATGG CGAGTCTAAG GTCGCACTCG 

10 CCCCCCGGAG CCCGTGCCCG TGTTGCCCTC 

ATATTTCCGT GACGAGGAAG GTCAAGACGG 

TCCGGGCAAA TTGACTCAGA ACCGCACTCA 

TCTCTTTATC GACAACATTT TCCGCTTCAC 

CCGCCCTGCT TGGTCGTATT CCCTCCGCCG 

15 GCCGTCGACA TGGGTGGTAT GCAGGAACGT 
CTCCATTACC TYCGTG 



ID NO: 1848 

CTTCCTCTCT ACAAATGACG 50 

GCTCGTTATA G7\ACAACATT 100 

ACTGGTGTCG GCGAGCGGAC 150 

GCAGGAAACC CGTGTTATCC 200 

TCTTCGGTCA GATGAACGAG 250 

ACTGGCCTGA CCATTGCTGA 300 

TATGTATTCA TATAAATTAC 350 

CTCACACATA TATTAGTGCT 400 

CCAGGCCGGT TCCGAAGTGT 450 

TCGGTTACCA ACCCACTCTC 500 

ATCACAACCA CCACCAAGGG 550 

566 



20 2) INFORMATION FOR SEQ ID NO: 1849 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 817 bases 

(B) TYPE: Nucleic acid 
25 (C) STRANDEDNESS: Double 

(D) TOPOLOGY: Linear 

(ii) MOLECULE TYPE: Genomic DNA 

30 (vi) ORIGINAL SOURCE: 

(A) ORGANISM: Campy lohacter coli 

(B) STRAIN: ATCC 4 3479 



35 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 1849 



AATGAAGCCA TTGTTGTAAA TTTTGAAAGT GAAGGCAAAA AACAAAAACT 50 

TGTTTTAGAA GTAGCAGCAC ACTTGGGCGA TAATAGAGTT AGAACTATTG 100 

CTATGGATAT GACAGATGGC TTGGTAAGAG GACTTAAAGC AGAAGCTTTG 150 

GGTGCTCCTA TTAGCGTTCC TGTGGGTGAA AAAGTTTTAG GAAGAATTTT 200 

40 TAATGTTACG GGAGATTTGA TCGATGAAGG TGAAGAAATT TCTTTTGATA 250 

AAAAATGGGC AATTCATAGA GATCCACCAG CTTTTGAAGA TCAAAGCACA 300 

AAAAGTGAGA TTTTTGAAAC AGGGATTAAA GTTGTGGATT TACTTGCTCC 350 

TTATGCAAAA GGTGGTAAAG TAGGTCTTTT TGGTGGTGCA GGTGTTGGTA 4 00 

AAACTGTTAT TATTATGGAG CTTATTCACA ATGTTGCATT TAAACATAGC 4 50 

45 GGCTATTCTG TATTTGCAGG TGTAGGTGAG AGAACTCGTG AAGGAAATGA 500 

CCTTTATAAT GAAATGAAAG AAAGTAATGT TTTAGACAAA GTTGCTCTAT 550 

GTTATGGACA AATGAATG/^ CCACCAGGGG CAAGT^TCG TATTGCTTTA 600 

ACAGGTTTAA CAATGGCTGA GTATTTTAGA GATGAAATGG GTCTTGATGT 650 

GCTTATGTTT ATTGATAATA TCTTTAGATT TTCACAATCA GGTTCTGAAA 700 

50 TGTCAGCACT TTTAGGAAGA ATTCCATCAG CTGTGGGTTA TCAACCAACC 750 

CTAGCAAGTG AAATGGGTAA ATTCCAAGAA AGAATTACTT CAACTAAAAA 800 

AGGATCAATT ACTTCAG 817 



967 
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10 



35 



45 



(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 775 bases 

(B) TYPE: Nucleic acid 

(C) STRANDEDNESS: Double 

(D) TOPOLOGY: Linear 

(ii) MOLECULE TYPE: Genomic DNA 



(vi) -ORIGINAL SOURCE: 

(A) ORGANISM: Campylobacter fetus subsp. fetus 

(B) STRAIN: ATCC 25936 

15 (xi) SEQUENCE DESCRIPTION: SEQ ID NO: 1850 

AAGGCAACAC GCATAAACTT ATTTTAGAGA CTGCTGCACA CCTTGGAGAT 50 

AATCGTGTAA GAACTATCGC TATGGATATG AGCGAAGGAC TTACAAGAGG 100 

GTTAGATGCT ATAGCGCTTG GGTCGCCTAT CAGTGTTCCT GTTGGAGAAA 150 

20 AAGTTTTAGG AAGAATATTC AACGTAATTG GTGATCTTAT AGACGAAGGC 200 

GAAGAAGAAA AATTTGATAA AAAATGGTCG ATTCATAGAG ATCCGCCGGC 250 

ATTTGAAGAT CAAAGCACAA AAAGTGAAAT TTTTGAAACA GGTATAAAAG 300 

TCGTAGATCT TTTGGCTCCT TATGCAAAAG GCGGTAAAGT TGGACTATTT 350 

GGCGGTGCCG GCGTTGGTAA AACAGTTATC ATTATGGAAC TTATCCACAA 400 

25 CGTTGCATTC AAACACAGCG GCTATTCGGT ATTTGCCGGT GTCGGTGAAA 4 50 

GAACAAGAGA GGGTAACGAT CTTTATAATG AAATGAAAGA ATCCGGCGTT 500 

TTGGATAAAG TTGCCTTATG TTATGGACAA ATGAATGAAC CGCCGGGTGC 550 

AAGAAACCGT ATAGCGCTTA CTGGTCTTAC AATGGCTGAG TATTTTCGTG 600 

ACGAGATGGG ACTAGATGTT CTTATGTTTA TCGATAACAT CTTCCGTTTC 650 

3 0 TCACAATCAG GCTCAGAGAT GTCGGCTCTT CTTGGACGTA TCCCAAGTGC 700 

GGTTGGTTAT CAACC7\ACGT TAGCTAGCGA AATGGGAAGA CTTCTU^GAAA 750 

GAATCACATC AACTAAAAAA GGTTC 775 



2) INFORMATION FOR SEQ ID NO: 1851 



(i) SEQUENCE CHARACTERISTICS: 
(A) LENGTH: 793 bases 
40 (B) TYPE: Nucleic acid 

(C) STRANDEDNESS: Double 

(D) TOPOLOGY: Linear 



(ii) MOLECULE TYPE: Genomic DNA 

(vi) ORIGINAL SOURCE: 

(A) ORGANISM: Campylobacter fetus subsp. venerealis 

(B) STRAIN: ATCC 33561 

50 (xi) SEQUENCE DESCRIPTION: SEQ ID NO: 1851 

CGAAGCTATT GAAGTAAATT TTACAGTAGA AGGCAACACG CATAAACTTA 50 
TTTTAGAGAC TGCTGCACAC CTTGGAGATA ATCGTGTAAG AACTATCGCT 100 

968 
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ATGGATATGA GCGAAGGACT TACAAGAGGG TTAGATGCTA TAGCGCTTGG 150 

GTCGCCTATC AGTGTTCCTG TTGGAGAT^AA AGTTTTAGGA AGAATATTCA 200 

ACGTAATTGG TGATCTTATA GACGAAGGCG AAGAAGAA/^ ATTTGATAAA 250 

AAATGGTCGA TTCATAGAGA TCCGCCGGCA TTTGAAGATC AAAGCACAAA 300 

5 AAGTGAAATT TTTGAAACAG GTATAATVAGT CGTAGATCTT TTGGCTCCTT 350 

ATGCAAAAGG CGGTAAAGTT GGACTATTTG GCGGTGCCGG CGTTGGTAAA 4 00 

ACAGTTATCA TTATGGAACT TATCCACAAC GTTGCATTCA AACACAGCGG 4 50 

CTATTCGGTA TTTGCCGGTG TCGGTGAAAG AACAAGAGAG GGTAACGATC 500 

TTTATAATGA AATGAAAGAA TCCGGCGTTT TGGATAAAGT TGCCTTATGT 550 

10 TATGGACAAA TGAATGAACC GCCGGGTGCA AGAAACCGTA TAGCGCTTAC 600 

TGGTCTTACA ATGGCTGAGT ATTTTCGTGA CGAGATGGGA CTAGATGTTC 650 

TTATGTTTAT CGATAACATC TTCCGTTTCT CACAATCAGG CTCAGAGATG 7 00 

TCGGCTCTTC TTGGACGTAT CCCAAGTGCG GTTGGTTATC AACCAACGTT 750 

AGCTAGCGAA ATGGGAAGAC TTCAAGAAAG AATCACATCA ACT 7 93 



2) INFORMATION FOR SEQ ID NO: 1852 

20 (i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 825 bases 

(B) TYPE: Nucleic acid 

(C) STRANDEDNESS: Double 

(D) TOPOLOGY: Linear 



25 



(ii) MOLECULE TYPE: Genomic DNA 



(vi) ORIGINAL SOURCE: 

(A) ORGANISM: Campylobacter gracilis 
30 (B) STRAIN: ATCC 33236 

(xi.) SEQUENCE DESCRIPTION: SEQ ID NO: 1852 

GGACTATTTA CCGAAGATTA ACGAAGCTAT CGAGGTTAAA TTTGACGTCG 50 

35 AGGGCGCTCA TCGCAGGCTG ATCCTAGAGG TAGCCGCGCA CCTTGGAGAC 100 

AATCGCGTCC GCACGATCGC TATGGATATG AGCGATGGAC TTAGGCGAGG 150 

GCTTGAGGCC GTCGCTTTGG GCGCGCCTAT TACGGTGCCT GTGGGCGAGA 200 

AAGTTTTGGG TAGAATTTTT AATGTTACGG GCGATCTGAT CGACGAAGGC 250 

GAGGATGAAA AATTTGAAAC CCGCTGGTCG ATCCACAGAG ATCCGCCTAG 300 

40 CTTTGAAAAT CAAAGCACGA AGAGTGAAAT TTTTGAAACC GGCATTAAGG 350 

TAGTCGATCT GCTCGCCCCT TATGCAAAGG GCGGTAAGGT AGGACTATTC 400 

GGCGGTGCTG GCGTCGGTAA GACCGTCATC ATCATGGAAC TGATTCACAA 450 

CGTCGCTTTC AAACACAGCG GCTACTCCGT ATTTGCGGGT GTCGGCGAGC 500 

GAACGAGAGA GGGAAACGAC CTTTATAACG AGATGAAAGA ATCGGGCGTT 550 

45 TTGGATAAAG TCGCCTTGAC CTATGGTCAG ATGAACGAAC CGCCGGGAGC 600 

GAGAAACCGT ATCGCGCTAA CCGGTCTTAC GATGGCCGAG TATTTCCGCG 650 

ACGAGCTAGG GCTTGACGTT TTGATGTTTA TTGATAATAT CTTCCGCTTC 700 

TCGCAGTCGG GTTCGGAGAT GTCCGCGCTT TTAGGACGAA TTCCGTCCGC 7 50 

GGTCGGTTAT CAGCCTACGC TTGCCAGCGA AATGGGTAAA TTACAGGAGC 800 

50 GCATTACTTC TACTAAGAAG GGCTC 825 
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2) INFORMATION FOR SEQ ID NO: 1853 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 818 bases 

(B) TYPE: Nucleic acid 

(C) STRANDEDNESS : Double 

(D) TOPOLOGY: Linear 

(ii) MOLECULE TYPE: Genomic DNA 

(vi) ORIGINAL SOURCE: 

(A) ORGANISM: Campylobacter jejuni subsp. jejuni 

(B) STRAIN: ATCC 33560 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 1853 

TTTACCTCAA ATTAATGAAG CAATTGTTGT AAATTTTGAA AGCGAAGGAA 50 

AAAAACATAA ACTTGTTTTA GAAGTAGCAG CTCATTTAGG AGATAATAGA 100 

GTTAGAACTA TTGCTATGGA TATGACAGAT GGTTTGGTAA GGGGCTTAAA 150 

AGCTGAGGCT TTAGGTGCTC CTATTAGTGT TCCTGTTGGT GAGAAAGTTT 200 

TAGG/^GAAT TTTCAATGTT ACTGGAGATT TGATCGATGA AGGTGAAGAA 250 

ATTTCTTTTG ATAAAAAATG GGCAATTCAT AGAGATCCGC CAGCTTTTGA 300 

AGATCAAAGC ACAAAAAGTG AGATTTTTGA AACAGGGATT AAAGTTGTAG 350 

ATTTGCTTGC TCCTTATGCA AAAGGTGGTA AAGTAGGTCT TTTTGGTGGT 400 

GCAGGTGTTG GTAAAACTGT TATTATTATG GAGCTTATTC ACAATGTTGC 4 50 

ATTTAAGCAT AGCGGCTATT CTGTATTTGC AGGTGTGGGT GAGAGAACTC 500 

GTGAAGGAAA TGACCTTTAT AATGAAATGA AAGAAAGTAA TGTTTTAGAC 550 

AAAGTTGCTC TATGTTATGG ACAAATGAAT GAACCACCAG GAGCAAGAAA 600 

TCGTATTGCT TTAACAGGTT TAACAATGGC TGAGTATTTT AGAGATGAAA 650 

TGGGTCTTGA TGTGCTTATG TTTATTGATA ATATCTTTAG ATTTTCACAA 700 

TCAGGTTCTG AAATGTCAGC ACTTTTAGGA AGAATTCCAT CAGCTGTGGG 750 

TTATCAACCA ACCCTAGCAA GTGAAATGGG TAAATTCCAA GAAAGAATTA 800 

CTTCAACTAA AAAAGGCT 818 



2) INFORMATION FOR SEQ ID NO: 1854 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 830 bases 

(B) TYPE: Nucleic acid 

(C) STRANDEDNESS: Double 

(D) TOPOLOGY: Linear 

(ii) MOLECULE TYPE: Genomic DNA 

(vi) ORIGINAL SOURCE: 

(A) ORGANISM: Enterococcus cecorum 

(B) STRAIN: ATCC 43198 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 1854 

ATTACCTGAT ATCAACAACG CCTTATTGGT CTATAAAAAT GATGAACAAA 50 

970 



SUBSTITUTE SHEET (RULE 26) 



wo 01/23604 



PCT/CAOO/01150 



AAAGTAAAAT TGTGCTAGAA 
CGTACAATTG CCATGGAATC 
TGTCGATACT GGTAAACCAA 
GACGTGTCTT TAACGTTTTA 
5 GCACAAGATG CAGATCGTTC 
AGACTTAAGT ACAAGTACTG 
ACTTATTAGC ACCATATTTA 
GCCGGAGTAG GTAAAACCGT 
ACAAGAACAT GGTGGGATTT 

10 GTGAAGGAAA TGACTTGTAT 
AAAACTGCCA TGGTGTTTGG 
GCGTGTTGCT TTAACTGGGT 
AAGGACAAGA TGTGTTGCTA 
GCGGGTTCTG AAGTATCAGC 

15 TTATCAACCT ACATTGGCTA 
CTTCAACTAA GAAGGGCTCT 



GCTGCCTTAG AATTAGGTGA 

AACGGATGGT TTACAACGTG 

TTTCAGTTCC AGTTGGTAAA 

GGGGATACGA TTGATATGCA 

TGCAATTCAT AAAGCTGCAC 

AAATTTTAGA AACAGGGATT 

AAAGGTGGTA AAGTCGGTCT 

TTTAATCCAA GAATTAATCC 

CTGTATTTAC CGGTGTTGGT 

CATGAAATGC GTGATTCAGG 

TCAAATGAAC GAACCACCTG 

TAACGATTGC TGAATATTTC 

TTTATTGATA ACATCTTCCG 

CTTGCTTGGT CGTATGCCAT 

CAGAAATGGG TCAATTACAA 
ATCACTTCTA 



TGGCATCATT 100 

GGATGGAAGT 150 

GAAACGCTAG 200 

AGAACCATTT 250 

CAAAATTTGA 300 

AAAGTTATCG 350 

ATTCGGGGGT 4 00 

ATAATATTGC 4 50 

GAACGTACAC 500 

AGTTATTGAA 550 

GAGCTCGTAT 600 

CGTGATGTAG 650 

TTTCACTCAA 700 

CTGCCGTGGG 7 50 

GAACGTATCA 800 

830 



20 2) INFORMATION FOR SEQ ID NO: 1855 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 823 bases 

(B) TYPE: Nucleic acid 
25 (C) STRANDEDNESS: Double 

(D) TOPOLOGY: Linear 

(ii) MOLECULE TYPE: Genomic DNA 

30 (vi) ORIGINAL SOURCE: 

(A) ORGANISM: Enterococcus columbae 

(B) STRAIN: ATCC 51263 



35 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 1855 



TCTTTACCAG ATATCAATAA TGCGCTTATT GTCTATAAAA ATGATGAACA 50 

AAAAAGTAAA ATCGTGCTTG AAGCTGCTTT AGAGCTAGGA GATGGCATTA 100 

TTCGTACGAT TGCAATGGAA TCAACTGATG GATTGCAACG TGGAATGGAA 150 

GTTTTCGATA CAGGTAAGCC AATTTCAGTA CCAGTAGGTC GTGAAACATT 200 

40 AGGTCGTGTA TTTAATGTTT TAGGTGATAC CATTGATACG CAAGAAGCTT 250 

TTCCTGCTGA TGCGAATCGT GATGCGATTC ATAAATCAGC TCCAGCTTTT 300 

GAAGAATTAA GTACAAGTAC TGAAATCCTA GAAACAGGGA TTAAAGTTAT 350 

CGACTTACTA GCACCATACT TAAAAGGTGG GAAAGTTGGT CTATTCGGTG 400 

GTGCCGGTGT AGGTAAAACC GTATTAATTC AAGAATTAAT TCATAATATC 450 

45 GCCCAAGAAC ATGGGGGTAT TTCAGTATTT ACCGGTGTTG GTGAACGTAC 500 

ACGTGAAGGA AATGACTTGT ATCACGAAAT GCGTGATTCA GGCGTTATCG 550 

AAAA7ACTGC TATGGTGTTT GGGCAAATGA ACGT^CCACC TGGAGCACGT 600 

ATGCGTGTTG CGCTAACTGG ACTAACTATT GCGGAATACT TCCGTGATGT 650 

TGAAGGCCAA GACGTATTGC TATTTATTGA TAATATCTTC CGTTTTACTC 700 

50 AAGCAGGTTC TGAAGTTTCT GCCTTACTTG GTCGTATGCC TTCTGCGGTA 750 

GGTTATCAAC CTACTTTGGC TACTGAAATG GGTCAATTGC AAGAACGGAT 800 

TACATCAACG AAGAAAGGTT CGA 823 
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2) INFOR^4ATION FOR SEQ ID NO: 1856 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 826 bases 

(B) TYPE: Nucleic acid 

(C) STRANDEDNESS: Double 

(D) TOPOLOGY: Linear 

(ii) MOLECULE TYPE: Genomic DNA 

(vi) ORIGINAL SOURCE: 

(A) ORGANISM: Enterococcus dispar 

(B) STRAIN: ATCC 51266 

15 (xi) SEQUENCE DESCRIPTION: SEQ ID NO: 1856 

TTACCAGACA TTAATAATGC CTTGGTTGTC TATAAAAATG ACGAACAAAA 50 

AACCAAGATT GTATTAGAAG CTGCCTTAGA ACTAGGAGAT GGTGTGATTC 100 

GAACTATCGC CATGGAATCT ACTGATGGCT TACAACGGGG AATGGAAGTT 150 

20 GTCGATACTG GCAGTTCCAT TTCTGTACCG GTAGGAAAAG AAACATTGGG 200 

TCGTGTATTT AACGTTTTAG GAAATACAAT TGACTTAGAA GAACCTTTTC 250 

CAGCGGATGC TAAACGTAGT GGTATCCATA AAAAAGCGCC TGATTTTGAT 300 

GAATTAAGCA CTAGTACAGA AATTTTAGAA ACAGGGATTA AAGTTATTGA 350 

CCTATTAGCC CCTTATTTAA AAGGTGGTAA AGTCGGATTA TTCGGTGGTG 400 

25 CCGGAGTTGG TAAAACCGTT TTAATTCAAG AATTAATTCA TAATATTGCC 450 

CAAGAACATG GTGGGATTTC TGTTTTTACT GGTGTTGGTG AAAGAACACG 500 

TGAAGGTAAT GACTTGTATT ATGAAATGAA AGAATCTGGC GTTATCGAAA 550 

AAACTGCCAT GGTATTTGGT CAAATGAATG AGCCACCTGG TGCCCGGATG 600 

CGGGTTGCTT TAACCGGACT TACCATTGCG GAATACTTCC GGGACGTTGA 650 

3 0 AGGACAAGAT GTATTGCTCT TTATCGATAA TATTTTCCGT TTTACCCAAG 700 

CTGGTTCAGA AGTATCTGCC TTATTAGGAC GGATGCCCTC TGCCGTTGGT 750 

TATCAACCAA CTTTGGCTAC TGAAATGGGA CAACTTCAAG AACGGATTAC 800 

CTCAACGA/y^. AAAGGTTCTA TTACAT 826 

35 

2) INFORMATION FOR SEQ ID NO: 1857 

(i) SEQUENCE CHARACTERISTICS: 
40 (A) LENGTH: 814 bases 

(B) TYPE: Nucleic acid 

(C) STRANDEDNESS: Double 

(D) TOPOLOGY: Linear 

45 (ii) MOLECULE TYPE: Genomic DNA 

(vi) ORIGINAL SOURCE: 

(A) ORGANISM: Enterococcus malodoratus 

(B) STRAIN: ATCC 43197 



50 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 1857 

TCCTTACCAG ACATCAACi\A TGCGTTGATT GTTTACAAAA AAAATAAAAC 50 
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AAAAGTTGTT CTTGAAGCTG CTTTGGAACT 

CGATCTCTAT GGAATCAACA GATGGCTTGC 

GATACAGGCA AACCAATCTC AGTTCCCGTT 

TGTGTTTAAC GTATTAGGTG AJ\ACAATCGA 

5 AAGATGCAGT AAAAAGCGGT ATTCATAAAA 

CTTAGTACCA GTAATGAAAT TTTAGAAACA 

ATTAGCTCCT TACTTAAAGG GTGGTAAAGT 
GTGTTGGTAA AACCGTCTTG ATCCAAGAAT 

GAACACGGTG GTATTTCAGT GTTTACGGGT 

10 AGGGAACGAC CTTTATTATG AAATGAAGGA 

CTGCCATGGT GTTTGGACAA ATGAACGAGC 
GTTGCCTTGA CTGGTTTGAC ATTGGCTGAA 

ACAAGATGTG CTGTTGTTTA TCGACAACAT 

GTTCTGAAGT TTCTGCCTTG CTTGGCCGGA 

15 CAACCAACTT TGGCAACTGA AATGGGTCAA 

AACGAAGAAG GGCT 



PCT/CAOO/01150 

TGGTGATGGT GTTATCCGCA 100 

AACGTGGAAT GGAAGTTGTC 150 

GGTAAAGAAA CTTTAGGTCG 200 

CAAAGAAGCG CCTTTTCCAG 250 

AAGCGCCGGC TTTTGAAGAA 300 

GGGATCAAAG TTATCGACTT 350 

CGGACTATTT GGTGGTGCCG 400 

TGATTCATAA TATCGCCCAA 450 

GTTGGTGAAC GTACTCGTGA 500 

ATCAGGCGTT ATTGAGAAAA 550 

CGCCAGGTGC GCGTATGCGT 600 

TATTTCCGAG ATGAAGAAGG 650 

CTTCCGTTTC ACTCAAGCCG 700 

TGCCTTCAGC CGTTGGCTAC 750 

TTGCAAGAAC GAATCACTTC 800 

814 



20 2) INFORMATION FOR SEQ ID NO: 1858 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 791 bases 

(B) TYPE: Nucleic acid 
25 (C) STRANDEDNESS: Double 

(D) TOPOLOGY: Linear 

(ii) MOLECULE TYPE: Genomic DNA 

30 (vi) ORIGINAL SOURCE: 

(A) ORGANISM: Enterococcus mundtil 

(B) STRAIN: ATCC 43186 



35 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 1858 



CGCATTAGTT GTTTATAAAA ATGATGAGCA AAAATCAAAA GTTGTTCTTG 50 

AAGCAGCATT AGAATTAGGT GACGGTGTGA TCCGTACGAT CGCAATGGAA 100 

TCGACGGATG GACTACAACG TGGAATGGAA GTCATCGACA CAAGCAAAGC 150 

GATCTCTGTA CCAGTTGGAA CAGAAACATT AGGTCGTGTG TTCAACGTGT 200 

40 TAGGTGAT^C AATCGATTTG GAAGCACCAT TTCCAGAGGA TGCCCAAAGA 250 

AGCGAGATCC ACAAGAAAGC ACCAAATTTT GATGAATTAA GCACAAGTAC 300 

AGAGATTCTT GAAACTGGGA TCAAAGTCAT TGACTTATTA GCACCTTATT 350 

TAAAAGGTGG GAAAGTTGGA TTGTTTGGGG GTGCCGGTGT TGGTAAAACC 400 

GTACTGATCC AAGAATTGAT CCATAATATC GCCCAAGAAC ATGGGGGAAT 450 

45 CTCAGTGTTT ACCGGTGTAG GGGAACGTAC CCGTGAAGGA AACGATCTGT 500 

ATTACGAAAT GAAAGATTCA GGCGTAATCG AAAAAACAGC GATGGTGTTT 550 

GGACAAATGA ATGAGCCACC AGGTGCTCGT ATGCGTGTCG CACTAACTGG 600 

ATTGACGATT GCGGAATATT TCCGTGATGT CGAAGGACAA GACGTGCTCT 650 

TATTTATTGA TAATATTTTC CGTTTCACCC AAGCAGGTTC AGAAGTATCT 700 

50 GCCTTACTAG GACGTATGCC ATCAGCGGTT GGTTATCAAC CAACCTTAGC 750 

GACTGAAATG GGACAACTCC AAGAACGGAT CACTTCAACG A 791 
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2) INFORMATION FOR SEQ ID NO: 1859 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 817 bases 

(B) TYPE: Nucleic acid 

(C) STRANDEDNESS: Double 

(D) TOPOLOGY: Linear 

(ii) MOLECULE TYPE: Genomic DNA 

(vi) ORIGINAL SOURCE: 

(A) ORGANISM: Enterococcus raffinosus 

(B) STRAIN: ATCC 49427 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 1859 

TCCTTACCAG ACATCAACAA TGCGTTGATT GTTTATAAAA AAGATAAAAC 
AAAAGTTGTT CTTGAAGCTG CTTTGGAACT TGGTGATGGT GTTATTCGCA 
CAATCGCCAT GGAATCAACG GATGGATTAC AACGTGGAAT GGAAGTTGTC 
GATACTGGCA AGCCTATTTC TGTTCCAGTA GGAAAAGAAA CTCTAGGTCG 
TGTATTTAAT GTATTAGGTG AAACAATCGA CAAGGAAGCG CCTTTTCCAG 
AAGATGCAGA ;\AAAAGTGGT ATTCACAAGA AAGCACCAAC TTTCGAAGAA 
CTTAGCACAA GTAATGAGAT CTTAGAAACA GGAATCAAAG TTATTGACTT 
GTTAGCTCCT TACTTAAAAG GTGGTAAAGT TGGATTATTT GGTGGTGCCG 
GTGTTGGTAA AACAGTCTTG ATTCAAGAGC TAATTCATAA TATCGCTCAA 
GAACATGGTG GTATTTCTGT GTTTACTGGT GTTGGTGAAC GTACTCGTGA 
AGGGAACGAC CTTTATTATG AAATGAAAGA TTCTGGTGTT ATTGAGAAAA 
CTGCTATGGT GTTCGGTCAA ATGAACGAGC CGCCAGGTGC ACGTATGCGT 
GTTGCCTTAA CTGGTTTAAC CTTAGCCGAA TACTTCCGTG ATGAAGAAGG 
ACAAGATGTG TTGCTATTTA TTGACAACAT TTTCCGTTTC ACTCAAGCCG 
GATCAGAAGT TTCTGCCTTA CTTGGCCGTA TGCCGTCAGC AGTTGGTTAC 
CAACCGACTT TAGCAACTGA AATGGGTCAA TTACAAGAAC GTATTACGTC 
GACGAAAAAA GGTTCAA 



50 
100 
150 
200 
250 
300 
350 
400 
450 
500 
550 
600 
650 
700 
750 
800 
817 



2) INFORMATION FOR SEQ ID NO: 1860 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 852 bases 

(B) TYPE: Nucleic acid 

(C) STRANDEDNESS: Double 

(D) TOPOLOGY: Linear 

(ii) MOLECULE TYPE: Genomic DNA 

(vi) ORIGINAL SOURCE: 

(A) ORGANISM: Globicatella sanguis 

(B) STRAIN: ATCC 51173 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 1860 

CCTGACATTC ATAATGCATT AATTGTAACG AACGCTGATA TGGCGGATGT 50 
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AATGCAAGAA AATATTTCGG ATGAAGAAAA ATTATTAACC TTAGAAGTTG 100 

CACTGGATTT AGGTCATGGA ATGGTCCGGA CAATTGCGAT GGAATCAACC 150 

GATGGTTTGG AACGCGGCAT GACAGTTGTG GATTATTTAA CACCGATTAA 200 

AGTGCCAGTA GGCGAAGCCA CTTTAGGTAG AGTATTCAAT GTTTTAGGTG 250 

5 AGACAATTGA TGAACTAGAA CCGGTTGGCG ACGATGTTGA ACTCAAAAGT 300 

ATTCATCGTG AAGCCCCTAA ATATGAGGAC TTAGATAATA GTTTTCATGT 350 

TTTAGAAACC GGAATTAAGG TCATCGATTT ATTAGCTCCT TATATTAAAG 400 

GGGGAAAAAT CGGTTTATTC GGTGGTGCCG GAGTGGGTAA AACGGTCTTA 4 50 

ATTCAAGAAT TAATTCATAA TATTGCAGAA CAATTAGGAG GTATCTCAGT 500 

10 TTTCACTGGG GTTGGAGAAC GTACCCGTGA AGGGAATGAC CTCGTTTTTG 550 

A7y\TGCGAGA GTCAGGTGTA AGCAAGAAGA CGGCCATGGT TTTCGGTCAA 600 

ATGAATGAAC CACCTGGAGC ACGTATGCGT GTTGTCTTAA CAGGACTTAC 650 

AATGGCGGAA TATTTCCGTG ACGAATTGAA ACAAGACGTC TTATTATTTA 700 

TTGATAATAT TTATCGTTTT ACTCAAGCAG GTTCCGAAGT GTCAGCCTTA 750 

15 TTAGGTCGTA TGCCTTCAGC AGTAGGGTAT CAACCAACTT TAGCAAGTGA 8 00 

AATGGGACAA ATGCAAGAAC GTATTACGTC WACGAAGCRC GGTTCCATTA 850 

CA 852 



20 



30 



2) INFORMATION FOR SEQ ID NO: 1861 



(i) SEQUENCE CHARACTERISTICS: 
(A) LENGTH: 828 bases 
25 (B) TYPE: Nucleic acid 

(C) STRANDEDNESS: Double 

(D) TOPOLOGY: Linear 



(ii) MOLECULE TYPE: Genomic DNA 

(vi) ORIGINAL SOURCE: 

(A) ORGANISM: Lactococcus garvieae 

(B) STRAIN: ATCC 49156 

35 (xi) SEQUENCE DESCRIPTION: SEQ ID NO: 1861 

GCGCGACTCT TCCTGAGATT AATAACGCAC TCATCGTTTA CAAAGATGTA 50 

GACGGCGTTA AAACTAAAAT CGTCCTTGAA GTGGCGTTGG AACTTGGTGA 100 

TGGTGCCGTA CGTACCATCG CTATGGAATC AACTGATGGC TTGACACGTG 150 

40 GACTTGAAGT TCTCGATACA GGTAAAGCAA TCAGCGTACC TGTTGGTCAA 200 

GAAACACTTG GACGTGTCTT CAATGTACTT GGAGATGCTA TTGATGGAGG 250 
■ GGAAGCATTT GCTGAAAATG CAGAACGCAG CCCTATCCAT AAAAAAGCCC ' 300 

CATCTTTTGA TGAACTTTCA ACAGCAAATG AAATTCTGGT GACAGGGATT 350 

AAAGTTATTG ACTTGCTTGC CCCATACCTT 7\AAGGTGGTA AGATTGGGTT 400 

45 GTTCGGTGGT GCCGGAGTTG GTAAAACCGT CCTTATCCAA GAGTTGATTC 450 

' ACAATATTGC CCAAGAACAC GGTGGTATTT CCGTATTTAC TGGTGTTGGG 500 

GAACGTACAC GTGAAGGGT^ TGACCTTTAC TGGGAAATGA AAGAATCAGG 550 

CGTTATCGAA AAAACAGCCA TGGTCTTCGG TCAAATGAAT GAACCACCTG 600 

GAGCACGTAT GCGTGTTGCT CTTACTGGTT TGACAATTGC TGAATATTTC 650 

50. CGTGATGTAG AAAAACAAGA CGTTTTGCTT TTCATTGATA ATATCTTCCG 7 00 

TTTCACCCAA GCCGGTTCAG AAGTATCTGC CCTCTTAGGA CGTATGCCAT 750 

CAGCCGTTGG TTACCAACCT ACGCTTGCAA CTGAAATGGG TCAACTTCAA 800 

GAACGTATCA CTTCAACAAA ACAAGGTT 828 
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2) INFORMATION FOR SEQ ID NO: 1862 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 828 bases 

(B) TYPE: Nucleic acid 

(C) STRANDEDNESS : Double 

(D) TOPOLOGY: Linear 

(ii) MOLECULE TYPE: Genomic DNA 

(vi) ORIGINAL SOURCE: 

(A) ORGANISM: Lactococcus lactis 

(B) STRAIN: ATCC 11454 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 1862 

AATTGCCTGA RATTAACAAT GCCTTGATTG TCTACAAAGA TGTCAATGGC 50 

CTAAAAACAA AAATTACTCT TGAAGTTGCT TTGGAACTTG GTGATGGTGC 100 

AGTTCGTACA ATCGCTATGG AATCTACTGA TGGCTTGACT CGTGGACTTG 150 

AAGTCCTTGA TACAGGTAAA GCAGTCAGCG TTCCTGTTGG GGAAGCCACT 200 

CTTGGTCGTG TTTTTAACGT TCTTGGTGAT GTTATTGACG GTGGGGAAGA 250 

ATTTGCTGCT GATGCAGAAC GTAATCCTAT CCATAAAAAA GCTCCAACAT 300 

TTGACGAATT GTCAACTGCA AACGAAGTTC TCGTAACTGG GATTAAAGTT 350 

GTCGATTTGC TTGCACCTTA CCTTAAAGGT GGTAAAGTTG GACTTTTCGG 400 

TGGTGCCGGA GTTGGTAAAA CCGTCCTTAT TCAAGAATTG ATTCACAACA 4 50 

TCGCCCAAGA ACACGGAGGT ATTTCTGTGT TTACCGGTGT TGGGGAACGT 500 

ACTCGTGAAG GGAATGACCT TTACTGGGAA ATGAAAGAAT CAGGCGTTAT 550 

TGAAAAAACT GCCATGGTCT TTGGTCAAAT GAATGAACCA CCAGGAGCAC 600 

GTATGCGTGT TGCCCTTACT GGTTTGACAA TTGCTGAATA TTTCCGTGAT 650 

GTTCAAGGTC AAGACGTACT GCTTTTCATT GACAACATCT TCCGTTTCAC 700 

ACAAGCTGGT TCAGAAGTTT CTGCCCTTTT GGGACGTATG CCTTCTGCCG 750 

TTGGTTACCA ACCAACACTT GCTACTGAAA TGGGGCAATT GCAAG2\ACGT 800 

ATCACTTCTA CTAAGAAGGG TTCTGTTA 828 



2) INFORMATION FOR SEQ ID NO: 1863 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 825 bases 

(B) TYPE: Nucleic acid 

(C) STRANDEDNESS: Double 

(D) TOPOLOGY: Linear 

(ii) MOLECULE TYPE: Genomic DNA 

(vi) ORIGINAL SOURCE: 

(A) ORGANISM: Listeria ivanovii 

(B) STRAIN: ATCC 19119 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 1863 

CTTACCTGAA ATCTACAACG CCCTAGTTAT TGAATATAAA TCTGATGCAG 50 
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AAGT^GCACC AACTAGCCT^A CTTACTTTAG AAGTAGCCAT TCAATTAGGT 100 

GATGACGTTG TTCGTACAAT TGCAATGGCA TCAACTGATG GTGTTCAAAG 150 

AGGTATGGAA GTTATTGATA CTGGGAGCCC AATCACAGTT CCAGTTGGTA 200 

CAGTAACACT TGGTCGTGTA TTTAACGTAT TAGGAAACAC TATCGATTTG 250 

5 GATGAGCCAC TTCCAAGCGA TATCAAGCGT AATAAAATTC ACCGTGAAGC 300 

ACCAACATTT GACCAATTAG CAAC7VACTAC TGAAATTCTT GAAACAGGAA 350 

TAAAAGTTGT AGACTTGCTA GCCCCATATT TAAAAGGTGG TAAT^TTGGT 400 

TTGTTCGGCG GAGCGGGTGT TGGTAAAACC GTTTTAATCC AAGAACTTAT 450 

TCATAATATC GCTCAAGAAC ATGGTGGTAT TTCTGTGTTC GCTGGTGTTG 500 

10 GAGAACGTAC TCGTGAAGGG AACGATCTTT ACTTTGAAAT GAAAGACTCT 550 

GGTGTAATTG T^AAAT^CTGC CATGGTATTC GGTCAAATGA ACGAACCACC 600 

AGGTGCTCGT ATGCGTGTAG CTTTAACAGG TCTTACAATC GCTGAATATT 650 

TCCGTGATGA AGAACACCAA GATGTACTTC TATTCATTGA TAATATTTTC 700 

CGCTTTACTC AAGCTGGTTC AGAGGTTTCG GCTTTACTAG GTCGTATGCC 750 

15 ATCTGCAGTA GGTTATCAAC CAACTCTAGC TACCGAAATG GGACAATTAC 8 00 

AAGAACGTAT TACTTCTACT AATGT 825 



20 2) INFORMATION FOR SEQ ID NO: 1864 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 821 bases 

(B) TYPE: Nucleic acid 
25 (C) STRANDEDNESS: Double 

(D) TOPOLOGY: Linear 

(ii) MOLECULE TYPE: Genomic DNA 

30 (vi) ORIGINAL SOURCE: 

(A) ORGANISM: Succinivihrio dextrinosolv^ens 

(B) STRAIN: ATCC 19716 



35 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 1864 



GTCATATTCC AGAGCTTTAT GACGCTCTGG AGGTTAAGGG CGATGGTAAG 50 

CATCGTTCAG ACCTAGTTCT TGAGGTTCAA CAGCAGATTG GCGGTGGTGT 100 

GGTACGCTGC ATTGCCATGG GTTCTTCTGA CGGTTTGAGC AGAGGAATTG 150 

AGGCTGTAAA TACTGGTGCC GGTGTTAAGG TTCCAGTTGG TCGTGAGACC 200 

40 CTAGGACGTA TTATGAACGT TTTAGGTCAG CCTGTAGATG AGAGAGGTCC 250 

TATCGGACAG AAAGAGGATT GGGAAATTCA CCGTCCAGCT CCTACCTATG 300 

CTGAGCAGTC ATCAACTACA GAAATTCTAG AAACCGGTAT TAAGGTTATG 350 

GACCTTATCT GCCCATTTGC TAAGGGTGGT AAAGTTGGTC TGTTCGGTGG 400 

TGCCGGTGTG GGTAAGACAG TTAACATGAT GGAGCTTATC AATAACATTG 450 

^ 45 CTAAGGCTCA CTCAGGTCTA TCTGTATTTA CCGGTGTTGG TGAGCGTACT 500 

CGTGAGGGTA ACGACTTCTA CCACGAAATG CAGGAATCAA AGGTTATCGA 550 

TAAGGTATCA ATGATTTACG GTCAGATGAA CGAGCCTCCA GGGAACCGTC 600 

TACGTGTTGC TCTGACAGGT CTGACTGTTG CTGAGAAGTT CCGTGACGAA 650 

GGTCTGGATG TGCTTCTGTT CATCGATAAC ATCTATCGTT ATACACTGGC 700 

50 TGGTACAGAG GTATCTGCTC TGTTAGGCCG TATGCCTTCA GCTGTGGGTT 750 

ACCAGCCTAC ACTGGCTGAG GAAATGGGTG TATTACAGGA GCGTATTGCT 800 

TCAACTAAGA AAGGTTCTAT T 821 
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2) INFORMATION FOR SEQ ID NO: 1865 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 822 bases 

(B) TYPE: Nucleic acid 

(C) STRANDEDNESS: Double 

(D) TOPOLOGY: Linear 

(ii) MOLECULE TYPE: Genomic DNA 

(vi) ORIGINAL SOURCE: 

(A) ORGANISM: Tetragenococcus halophilus 

(B) STRAIN: ATCC 33315 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 1865 

TGATACTTTA CCAGATATCA ATAATGCATT AGCCGTATAT AAAAATGATG 50 

AGAACAAGAC GCGTGTTGTA TTGGAAGCTA CTTTAGAACT TGGAGATGGG 100 

GTAATTCGTG CCATTTCTAT GGGGTCTACT GACGGCTTGC AACGTGGCAT 150 

GGAAGTTGTG GATACACAAG AACCTATTTC TGTTCCGGTA GGAAATGATA 200 

CTTTAGGTCG TGTATTTAAT GTGTTAGGAG AAACAATAGA TAATCAGGAG 250 

CCATTTCCTG AAGATGCTGA AAAAAGTGGT ATTCACAAAA AAGCCCCTAG 300 

TTTTGATGAA TTAAGTACTA GTTCGGAAAT ATTAGAAACA GGGATCAAAG 350 

TGATTGATTT ATTAGAACCT TATCTAAGAG GCGGTAAAGT CGGATTGTTT 4 00 

GGAGGCGCCG GTGTTGGAAA AACGGTGCTA ATTCAAGAAT TGATCAATAA 4 50 

TGTTGCCCAA GAACACGGGG GTATTTCCGT GTTTAATGGT GTAGGTGAAC 500 

GTACTCGTGA AGGTAATGAC TTGTATTATG AAATGCAGGA TTCAGGCGTT 550 

ATCGAAAAAA CAGCCATGGT GTTTGGTCAA ATGAACGAAC CACCAGGTGC 600 

TCGTATGCGT GTTGCTTTAA CTGGCCTAAC ACTGGCAGAA TATTTTCGAG 650 

ATGTTGAAGG TCAAGACGTA TTATTATTTA TTGATAATAT TTTCCGTTTT 700 

ACACAAGCAG GTACCGAAGT TTCCGCTTTA CTTGGTAGAA TGCCATCTGC 750 

TGTTGGCTAT CAACCCACAC TAGCAACTGA AATGGGGCAA CTGCAAGAAC 800 

GGATTACGTC AACGGATAAG GG 822 



2) INFORMATION FOR SEQ ID NO: 1866 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 818 bases 

(B) TYPE: Nucleic acid 

(C) STRANDEDNESS: Double 

(D) TOPOLOGY: Linear 

(ii) MOLECULE TYPE: Genomic DNA 

(vi) ORIGINAL SOURCE: 

(A) ORGANISM: Campylobacter fetus subsp. fetus 

(B) STRAIN: ATCC 25936 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 1866 

ATCTCCTCAG GATCKATAGG ACTTGATATA GCTCTTGGTA TAGGCGGCGT 50 

978 
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ACCAAAAGGA AGAATAGTCG 
CAACTCTTAC TTTGCATTTA 
TGCGCGTTTG TAGATGCAGA 
TTTAGGCGTT GATACGGATA 
5 AGCAAGCTCT TGATATAGTA 
CTTATAGTAA TAGATAGCGT 
AGGCGATATG GGAGATCAGC 
AAGCACTTAG AAAATTAACC 
GTATTTATAA ACCAAATTCG 

10 TCCTGAAACT ACTACTGGCG 
GACTTGACGT ACGTAAAATA 
GGAAACCGCG TAAAAGTAAA 
TAGACAAGCC GAATTTGATA 
GAGAGATAAT AGATTACGGC 

15 GCTTGGTTTA GCTATGATAA 
AAAAGCGTTT TTAAAAGA 



AAATTTATGG GCCAGAAAGC 
ATAGCAGAAT CTCAAAAAGT 
GCATGCACTT GATGTTAAAT 
ACTTATATAT TTCTCAACCG 
GAAACTCTAG CTAGAAGCGG 
AGCAGCTYTA ACACCAAAAA 
ACGTAGGGCT GCAAGCAAGA 
GGAGTTGTCC ATAAAATGGG 
TATGT^AAATC GGCGCTATGG 
GAAATGCGCT TAAATTTTAC 
GCTACTTTAA AACAGAGCGA 
AGTAGTAAAA AACAAAGTCG 
TCATGTTTGG AGAAGGTATC 
GTAAAACTTG ATATTATCGA 
TTCAAAATTA GGTCAAGGCA 



TCTGGTAAAA 


100 


CGGCGGAGTT 


150 


ATGCTAAAAA 


200 


GACTTCGGAG 


250 


CGCCGTTGAT 


300 


GCGAAATAGA 


350 


CTCATGAGTC 


400 


AACTACAGTT 


450 


GCTATGGCAC 


500 


GCTTCAGTTA 


550 


TGAGCCAATC 


600 


CTCCTCCTTT 


650 


AGCAAAGAAG 


700 


TAAAAGCGGC 


750 


GAGAAAACTC 


800 




818 



20 2) INFORMATION FOR SEQ ID NO: 1867 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 814 bases 

(B) TYPE: Nucleic acid 
25 (C) STRANDEDNESS: Double 

(D) TOPOLOGY: Linear 

(ii) MOLECULE TYPE: Genomic DNA 

30 (vi) ORIGINAL SOURCE: 

(A) ORGANISM: Campylobacter fBtus subsp. venerealis 

(B) STRAIN: ATCC 33561 
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(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 1867 



TCTCCTCAGG ATCKATAGGA CTTGATATAG CTCTTGGTAT AGGCGGCGTA 50 

CCAAAAGGAA GAATAGTCGA T^TTTATGGG CCAGAAAGCT CTGGTAAAAC 100 

AACTCTTACT TTGCATTTAA TAGCAGAATC TCAAAAAGTC GGCGGAGTTT 150 

GCGCGTTTGT AGATGCAGAG CATGCACTTG ATGTTAAATA TGCTAAAAAT 200 

40 TTAGGCGTTG ATACGGATAA CTTATATATT TCTCAACCGG ACTTCGGAGA 250 

GCAAGCTCTT GATATAGTAG AAACTCTAGC TAGAAGCGGC GCCGTTGATC 300 

TTATAGTAAT AGATAGCGTA GCAGCTYTAA CACCAAAAAG CGAAATAGAA 350 

GGCGATATGG GAGATCAGCA CGTAGGGCTG CAAGCAAGAC TCATGAGTCA 400 

AGCACTTAGA AAATTAACCG GAGTTGTCCA TAAAATGGGA ACTACAGTTG 450 

45 TATTTATAAA CCAAATTCGT ATGAAAATCG GCGCTATGGG CTATGGCACT 500 

CCTGAAACTA CTACTGGCGG AAATGCGCTT AAATTTTACG CTTCAGTTAG 550 

ACTTGACGTA CGTAAAATAG CTACTTTAAA ACAGAGCGAT GAGCCAATCG 600 

GAAACCGCGT AAAAGTAAAA GTAGTAAAAA ACAAAGTCGC TCCTCCTTTT 650 

AGACAAGCCG AATTTGATAT CATGTTTGGA GAAGGTATCA GCAAAGAAGG 700 

50 AGAGATAATA GATTACGGCG TAAAACTTGA TATTATCGAT AAAAGCGGCG 750 

CTTGGTTTAG CTATGATAAT TCAAAATTAG GTCAAGGCAG AGAAAACTCA 800 

AAAGCGTTTT TAAA 814 
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2) INFORMATION FOR SEQ ID NO: 1868 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 824 bases 

(B) TYPE: Nucleic acid 

(C) STRANDEDNESS : Double 

(D) TOPOLOGY: Linear 

(ii) MOLECULE TYPE: Genomic DNA 

(vi) ORIGINAL SOURCE: 

(A) ORGANISM: Campylobacter jejuni jejuni 

(B) STRAIN: ATCC 33560 

15 (xi) SEQUENCE DESCRIPTION: SEQ ID NO: 1868 

GATAGCATAG GCWCAGGTTC AGTTGGACTT GATCTTGCTT TAGGTATAGG 50 

CGGTGTTCCA AAAGGAAGAA TTATAGAAAT TTATGGGCCT GAAAGTTCAG 100 

GTAAAACTAC TCTAACTCTA CACATTATCG CAGAATGCCA AAAAGCAGGT 150 

20 GGGGTTTGTG CTTTTATCGA TGCAGAACAT GCACTTGATG TGAAATATGC 200 

TAAAAATTTA GGTGTAAATA CAGATGATTT GTATGTTTCT CAACCTGATT 250 

TTGGAGAGCA AGCCTTAGAA ATTGTAGAAA CTATAGCWAG AAGTGGTGCA 300 

GTAGATCTTA TWGTAGTAGA TAGCGTTGCA GCWCTTACCC CAAAAGCAGA 350 

AATTGAAGGC GATATGGGCG ATCARCATGT AGGACTTCAA GCAAGACTTA 400 

25' TGTCTCAAGC TCTAAGAAAA CTTACAGGTA TAGTTCATAA AATGAATACC 450 

ACAGTAATTT TCATCAACCA AATTCGTATG AAAATCGGTG CTATGGGTTA 500 

TGGTACTCCT GAAACCACAA CAGGTGGAAA TGCATTAAAA TTTTATGCTT 550 

CTGTGCGTTT AGATGTTAGA AAAGTAGCAA CCTTAAAMCA AAACGWAGAM 600 

CCTATAGGAA ACCGCGTTAA AGTAAAAGTA GTTAAAAATA AAGTTGCTCC 650 

30 TCCATTCAGM CAAGCTGAAT TTGATGTGAT GTTTGGAGAG GGTTTAAGCC 700 

GTGAAGGTGA ATTGATCGAT TATGGTGTAA AACTTGATAT CGTAGATAAA 750 

AGTGGTGCGT GGTTTTCTTA TAAAGATAAA AAACTTGGAC AAGGTAGAGA 800 

AAATTCAAAA GCTTTCTTAA AAGA 824 

35 

2) INFORMATION FOR SEQ ID NO: 1869 

(i) SEQUENCE CHARACTERISTICS: 
40 (A) LENGTH: 388 bases 

(B) TYPE: Nucleic acid 

(C) STRANDEDNESS: Double 

(D) TOPOLOGY: Linear 

45 (ii) MOLECULE TYPE: Genomic DNA 

(vi) ORIGINAL SOURCE: 

(A) ORGANISM: Enterococcus avium 

(B) STRAIN: ATCC 14025 



50 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 1869 

AGAAAGTTCT GGTAT^AACAA CGGTTGCACT GCATGCGATT GCAGAAGTTC 50 

980 
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AAAAACATGG CGGGACGGCA GCCTTTATTG 
CCTCAATACG CACAACGTCT AGGTGTAAAC 
ACAACCAGAT ACTGGGGAAC AAGGCTTAGA 
CAAGTGGCGC AGTCGATATT ATCGTTATTG 
5 CCCCGTGCTG AAATCGATGG CGAGATGGGT 
GGCTCGTTTG ATGTCACAAG CATTGCGCAA 
AAACAAAGAC TATCGCCGTC TTTATTAACC 



PCT/CAOO/01150 

ATGCCGAGCA CGCGTTGGAC 100 

ATTGATGAAT TGCTGCTATC 150 

AATTGCAGAT GCTTTAGTTT 200 

ACTCGGTGGC CGCGCTAGTC 250 

GATGCGCACG TTGGTCTGCA 300 

GCTGTCAGGC TCTATCAACA 350 

TW^TTCGT 388 



10 



20 



2) INFORMATION FOR SEQ ID NO: 1870 



(i) SEQUENCE CHARACTERISTICS: 
(A) LENGTH: 388 bases 
15 (B) TYPE: Nucleic acid 

(C) STRANDEDNESS: Double 

(D) TOPOLOGY: Linear 



(ii) MOLECULE TYPE: Genomic DNA 

(vi) ORIGINAL SOURCE: 

(A) ORGANISM: Enterococcus faecium 

(B) STRAIN: ATCC 19434 

25 (xi) SEQUENCE DESCRIPTION: SEQ ID NO: 1870 

TGAAAGTTCA GGTT^AAACAA CAGTTGCACT ACACGCTATT GCAGAAGTAC 50 

AAAAAAATGG CGGAACGGCC GCTTTCATTG ATGCTGAGCA TGCGTTAGAT 100 

CCGCAATATG CACAAAAATT AGGTGTGAAT ATCGATGAAC TACTTCTTTC 150 

30 ACAGCCTGAC ACAGGAGAAC AAGGTCTAGA GATCGCTGAT GCTTTAGTAT 200 

CAAGTGGGGC TGTAGATATC GTAGTAGTCG ATTCAGTTGC TGCTTTAGTT 250 

CCACGAGCAG AAATCGACGG CGAAATGGGT GACTCACATG TCGGGTTACA 300 

AGCACGTTTG ATGTCTCAAG CATTGCGTAA ACTCTCTGGT TCGATCAACA 350 

AAACAAAAAC AATCGCTATT TTCATCAACC AAATCCGT 388 

35 



2) INFORMATION FOR SEQ ID NO: 1871 

10 (i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 388 bases 

(B) TYPE: Nucleic acid 

(C) STRANDEDNESS: Double 

(D) TOPOLOGY: Linear 



i5 



(ii) MOLECULE TYPE: Genomic DNA 



(vi) ORIGINAL SOURCE: 

(A) ORGANISM: Listeria monocytogenes 
50 (B) STRAIN: ATCC 15313 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 1871 
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10 



25 



AGAGAGTTCC GGTAAAACAA CTGTTGCGCT TCATGCAATT GCGGAAGTAC 50 

AAGCACAAGG CGGAACAGCA GCATTTATCG ATGCTGAGCA TGCGTTGGAT 100 

CCGGCTTATG CTAAAAACCT AGGTGTAAAT ATTGATGAAT TATTACTATC 150 

TCAACCAGAT ACAGGAGAAC AAGCTTTAGA GATTGCTGAA GCTTTAGTTA 200 

GAAGTGGTGC AGTTGATATG TTAGTAATTG ACTCCGTTGC AGCACTTGTA 250 

CCACGTGCTG AAATCGAAGG CGAGATGGGC GATGCTCATG TTGGATTACA 300 

AGCACGTTTA ATGTCCCAAG CATTGCGTAA ACTTTCTGGT GTTATTAATA 350 

AATCAAAAAC CATTGCTATT TTCATTAACC AAATTCGT 388 



2) INFORMATION FOR SEQ ID NO: 1872 



(i) SEQUENCE CHARACTERISTICS: 
15 (A) LENGTH: 388 bases 

(B) TYPE: Nucleic acid 

(C) STRANDEDNESS: Double 

(D) TOPOLOGY: Linear 

2 0 (ii) MOLECULE TYPE: Genomic DNA 

(vi) ORIGINAL SOURCE: 

(A) ORGANISM: Streptococcus mitis 

(B) STRAIN: ATCC 49456 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 1872 



AGAGTCATCT GGTAAGACAA CGGTTGCCCT TCATGCAGTT GCGCAAGCAC 50 

AAAAAGAAGG TGGTATTGCT GCCTTTATCG ATGCGGAACA TGCCCTTGAT 100 

30 CCAGCTTATG CTGCGGCCCT TGGTGTCAAC ATTGACGAAT TGCTCTTGTC 150 

ACAACCAGAC TCAGGAGAGC AAGGTCTTGA GATTGCAGGA AAATTGATTG 200 

ACTCAGGAGC CGTGGATCTT GTCGTAGTCG ACTCAGTTGC GGCCCTTGTC 250 

CCTCGTGCGG AAATTGATGG AGATATCGGT GATAGCCACG TTGGTTTGCA 300 

GGCTCGTATG ATGAGCCAGG CTATGCGTAA ACTTGGTGCT TCTATCAATA 350 

35 AAACCAAAAC AATTGCCATC TTTATCAACC AATTGCGT 388 



2) INFORMATION FOR SEQ ID NO: 1873 



40 



(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 430 bases 

(B) TYPE: Nucleic acid 

(C) STRANDEDNESS: Double 
45 . (D) TOPOLOGY: Linear 

(ii) MOLECULE TYPE: Genomic DNA 

(vi) ORIGINAL SOURCE: 
50 (A) ORGANISM: Streptococcus oralis 

(B) STRAIN: ATCC 35037 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 1873 

982 
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GAACATGACG CCGACTTTTT 
TTTTGGTTTT ATTGATAGAA 
ATTCGAGCCT GCAAACCAAC 
5 CGCACGAGGT ACAAGGGCCG 
CACCTGAGTC AATCAATTTT 
TCTGGTTGTG ACAAGAGCAA 
ATAGGCTGGG TCAAGAGCAT 
CTTCTTTCTG TGCTTGCGCA 
LO GATGATTCTG GCGCRTACAY 



CACGCAATTG GTTGATAAAG 

GCTCCGAGTT TACGCATGGC 

GTGACTGTCT CCAATATCCC 

CAACTGAGTC GATAACGACA 

CCAGCAATTT Ci\AGACCTTG 

TTCGTCAATA TTCACACCAA 

GTTCCGCATC GATAAAGGCT 

ACAGCGTGAA GGGCAACCGT 
TTCGATGATA 



ATGGCAATTG 50 

CTGGCTCATC 100 

CATCAATTTC 150 

AGGTCAACTG 200 

TTCACCTGAG 250 

GGGCTGCAGC 300 

GCAATACCAC 350 

TGTCTTACCA 400 

430 



2) INFORMATION FOR SEQ ID NO: 1874 



L5 



(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 947 bases 

(B) TYPE: Nucleic acid 

(C) STRANDEDNESS: Double 
20 (D) TOPOLOGY: Linear 

(ii) MOLECULE TYPE: Genomic DNA 

(vi) ORIGINAL SOURCE: 
25 (A) ORGANISM: AspGrgillus fumigatus 

(B) STRAIN: ATCC 64746 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 1874 

30 TTGTCGTTGT TGCTGCCTCC GACGGTCAGA TGTAGGTGGA ACATCTTGGG 50 

AAATACGTCG TAAAACACGT CGCTTACGTT TTCGCGAATA GGCCCCAGAC 100 

TCGTGAGCAT TTGCTGCTCG CCCGCCAGGT TGGTGTCCAG AAGATCGTTG 150 

TCTTCGTCAA CAAAATCGAT GCTATTGATG ATCCGGAGAT GCTGGAACTG 200 

GTCGAACTCG AGATGCGTGA GCTGCTGAAC AGCTACGGTT TCGAGGGTGA 250 

IS AGAGACTCCG ATCATTTTCG GTTCCGCTCT CTGTGCTCTC GAAGGACGCC 300 

GTGACGACAT CGGTAAAGAC AGAATTGAGC AGCTTATGAA CGCTGTCGAC 350 

ACCTGGATCC CCACTCCTCA GCGTGACCTC GACAAACCTT TCTTGATGTC 400 

TGTCGAGGAA GTGTTCTCTA TCGCCGGCCG TGGTACCGTG GCTTCTGGTC 4 50 

GTGTCGAGCG TGGTATCTTG AAGAAGGACT CTGAGGTTGA GATTGTTGGA 500 

10 GGCTCCTTCG AACCCAAGAA GACCAAAGTC ACCGACATTG AAACCTTCAA 550 

GAAGAGCTGT GATGAATCGC GTGCTGGTGA CAACTCTGGT CTCCTCCTGC 600 

GTGGTATCCG ACGTGAAGAC GTCAAGCGTG GTATGGTCAT TGCTGTTCCC 650 

GGCAGCACCA AGGCTCACGA CAAGTTCCTC GTCTCCATGT ACGTCCTGAC 700 

CGAGGCGGAG GGTGGTCGTC GTACTGGCTT CGGTGCCAAC TACCGTCCCC 750 

15. AAGTCTTCAT CCGTACTGCA GGTAAGTTCC CGCACACCGT GTCCAGATCT 800 

TCCGAGAGAT TAGCGATATA TGCTAATGAT TCATCAGACG AGGCTGCTGA 850 

CCTCAGCTTC CCTGACGGCG ACCAATCTCG CAGAGTTATG CCTGGTGACA 900 

ACGTCGAGAT GATCCTGAAG ACCCACCACC CTGTTGCTGC TGAGGCT 947 



2) INFORMATION FOR SEQ ID NO: 1875 
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(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 923 bases 

(B) TYPE: Nucleic acid 

(C) STRANDEDNESS: Double 
5 (D) TOPOLOGY: Linear 

(ii) MOLECULE TYPE: Genomic DNA 

(vi) ORIGINAL SOURCE: 
10 (A) ORGANISM: Aspergillus versicolor 

(B) STRAIN: WSA-175 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 1875 

15 GCTGCTTCCG ATGGTCAAAT GTACGTCAAC CTTATATACA CCCTCTGATA 50 

TGACAGAATG TCGCCTAACA GCACGCGTGA ACTAGGCCCC AAACCCGTGA 100 

GCACTTGCTG CTTGCCCGCC AGGTCGGTGT CCAGAAGATT GTTGTGTTCG 150 

TCAACAAGGT TGATGCCGTC GATGACCCTG AGATGTTGGA ACTTGTTGAG 200 

CTGGAAATGC GTGAGCTTCT CAGCACTTAC GGCTTCGAAG GCGAGGAGAC 250 

20 CCCTATCATC TTCGGTTCCG CCCTATGCGC CCTCGAGGGT CGCCGCCCCG 300 

ATATCGGTAC TGAGCGAATT GACAGCCTTC TTGAGGCCGT TGACACCTGG 350 

ATCCCTACCC CTCAGCGTGA CCTGGACAAG CCTTTCCTGA TGTCTGTCGA 400 

GGAGGTCTTC TCCATTGCCG GTCGTGGTAC CGTTGCCTCT GGCCGTGTTG 450 

AGCGTGGTCT CCTTAAGAAG GACAGCGAGG TCGAGATTCT CGGAGGTGGT 500 

25 CAGGTCATGA AGACCAAGGT CACTGACATT GAGACATTCA AGAAGCACTG 550 

TGACGAATCC CGTGCTGGTG ACAACTCCGG TCTTCTTCTC CGTGGTATCC 600 

GCCGTGAGGA TGTCAAGCGC GGTATGGTTA TTGCTGCTCC CGCCTCTATC 650 

AAGGCCCACA AGAAGTTCAT GGTCTCCATG TACGTCCTCA CTGAGGCAGA 700 

AGGTGGCCGT CGCAGTGGCT TCGGTGTCAA CTATCGTCCC CAGGCTTACA 7 50 

30 TTCGCACTGC TGGTAAGTTT TCAAAACATT CGACCTCTCG CCTTAGAAGA 800 

AGAATACCTC TAACTTGTAT TTAGACGAGG CTTGCGACCT TTCTTTCCCC 850 

GATGGCGACA TGAGCCGCCG TGTCATGCCT GGTGACAACG TGGAAATGAT 900 

CCTCAACCTC AACAACCCTG TCG 923 

35 

2) INFORMATION FOR SEQ ID NO: 1876 

(i) SEQUENCE CHARACTERISTICS: 
iO (A) LENGTH: 807 bases 

(B) TYPE: Nucleic acid 

(C) STRANDEDNESS: Double 

(D) TOPOLOGY: Linear 

i5 (ii) MOLECULE TYPE: Genomic DNA 

(vi) ORIGINAL SOURCE: 

(A) ORGANISM: Basidiobolus ranarum 

(B) STRAIN: ATCC 24 670 



50 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 1876 

CATCATTGTT GTGTCCGCCA CTGATGGTCA AATGCCTCAA ACTCGTGAGC 50 
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ATTTGTTGTT GGCTCGCCAA 
AACAAGGTTG ATGCCGTTGA 
GGAAATGCGT GATTTGCTTT 
CCATTATCCA GGGTTCCGCT 
5 ATCGGCCGTA ACGCCATCAT 
CCCCACCCCT GCCCGTGATT 
ATGTGTTCTC CATCTCTGGC 
CGTGGTATGG TCACCAAGGG 
GCACTTCAAG ACCACCTTGA 

LO ACAAGGGTAT GGCTGGTGAC 
CGTGAGCAAG TCCGTCGTGG 
GCCACATAAG AAGTTCATGG 
GAGGCCGCCA CACTCCCTTC 
AGAACTGTTG ATGTTACCGC 

L5 TGCTGATGAG AAGATGGTCA 
AGCTCTT 



GTTGGTGTCC AACACTTGGT 
TGATCCTGAA ATGTTGGAGT 
CCCAATACGG TTTCCCCGGA 
CTCTGCGCTC TTGAGGACCG 
GAAGTTGATG GAGGCTGTTG 
TGGACAAACC TTTCCTCATG 
CGTGGTACTG TTGCCACTGG 
TACTGAAGTT GAAATCGTCG 
CCGGTATTGA AATGTTCCAC 
AACATGGGTT GCTTGCTTCG 
TATGGTTATC TGTGCCCCCG 
CTCAGCTCTA CATTCTCACC 
GTCAACAACT ACCGCCCACA 
CATCCTTAAG CACCCCCCTG 
TGCCCGGAGA CAACGTTCAA 



TGTCTTCATC 100 

TGGTCGAGAT 150 

GACAACGTCC 200 

CAACCCCGAG 250 

ATAGCAGCAT 300 

CCCGTTGAAG 350 

ACGTGTTGAG 4 00 

GTATGGGCGA 450 

AAGGAATTGG 500 

TGGTGTCAAG 550 

GATCCGTCAA 600 

AAGGATGAGG 650 

AATGTTCTTC 700 

GTACCCCCGA 750 

CTCGAGTGCG 800 

807 



20 2) INFORMATION FOR SEQ ID NO: 1877 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 806 bases 

(B) TYPE: Nucleic acid 

2 5 (C) STRANDEDNESS: Double 

(D) TOPOLOGY: Linear 

(ii) MOLECULE TYPE: Genomic DNA 

)0 (vi) ORIGINAL SOURCE: 

(A) ORGANISM: Campylobacter gracilis 

(B) STRAIN: ATCC 33236 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 1877 



55 



AGTTGTTTCT GCTGCGGATG GTCCTATGCC TCAAACTCGC GAGCATATCT 50 

TGCTTTCTCG TCAAGTAGGC GTTCCATACA TCGTAGTTTT CCTAAACAAA 100 

ACCGATATGG TCGATGATCC GGATCTTTTA GAGTTAGTTG AAGAGGAAGT 150 

TAGAGATCTT TTAAAAGAGT ATAAATTCCC TGGCGACGAA ACCCCAATCA 200 

10 TTAAGGGTTC TGCTCTTAAG GCTCTTGAGG AAGCTAAGGC CGGACAAGAC 250 

GGCGAATGGT CTGCAAAGAT TATGGAGCTT ATGGACGCGG TTGATAGCTA 300 

TATTCCAACT CCTGTTCGCG ATACTGATAA AGATTTCCTT CTTCCGATCG 350 

AAGATATTTT CTCGATTTCC GGTCGCGGTA CCGTTGTAAC CGGTAGAATC 400 

GAAAAAGGTA TCGTTAAAGT TGGTGATACT ATCGAGATCG TAGGTATTAA 450 

15 ACCTACTCAG ACTACTACCG TCACTGGCGT TGAGATGTTT AGAAAAGAGA 500 

TGGATCAAGG TGAAGCCGGC GATAATGTAG GTGTTTTATT GCGCGGTACT 550 

AAGAAAGAGG AAGTAGAGCG CGGTATGGTT TTATGCAAAC CAAAATCGAT 600 

CACTCCTCAT ACTAAATTTG AGGGCGAGGT TTATATCCTA ACTAAAGAAG 650 

AAGGCGGACG CCATACTCCA TTCTTTAATA ATTATAGACC GCAGTTTTAC 700 

50 GTTCGTACGA CAGATGTTAC CGGTTCGATT ACTCTTCCTG AAGGAACCGA 750 

GATGGTTATG CCGGGCGACA ACGTTAAAAT CACCGTTGAG CTAATCGCTC 800 

CGATCG 806 
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2) INFORMATION FOR SEQ ID NO: 1878 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 806 bases 

(B) TYPE: Nucleic acid 

(C) STRANDEDNESS: Double 

(D) TOPOLOGY: Linear 

(ii) MOLECULE TYPE: Genomic DNA 



(vi) ORIGINAL SOURCE: 

(A) ORGANISM: Campylobacter jejuni jejuni 

(B) STRAIN: ATCC 33292 

15 (xi) SEQUENCE DESCRIPTION: SEQ ID NO: 1878 

GCTGCAGATG GCCCTATGCC ACAAACTAGA GAGCACATTC TTCTTTCTCG 50 

TCAAGTAGGC GTTCCATATA TTGTTGTTTT TATGAATAAA GCAGATATGG 100 

TTGATGATGC TGAACTTTTA GAGTTAGTTG AAATGGAAAT TAGAGAATTA 150 

20 TTAAGCTCTT ATGATTTCCC AGGCGATGAT ACACCTATTA TTTCTGGTTC 200 

TGCTTTAAAA GCTCTTGAAG TUVGCTAAAGC TGGACAAGAT GGTGAATGGT 250 

CAGCAAAAAT TATGGATCTT ATGGCTGCAG TTGATAGCTA TATTCCAACT 300 

CC7\ACTCGTG ATACTGAAAA AGACTTCTTG ATGCCAATTG AAGATGTTTT 350 

CTCAATTTCA GGTCGTGGTA CTGTTGTTAC AGGTAGAATT GAAAAAGGTG 400 

25 TTGTAAAAGT AGGTGATACT ATCGAAATCG TTGGTATTAA AGATACTCAA 4 50 

ACAACAACTG TAACAGGTGT TGAAATGTTC AGAAAAGAAA TGGATCAAGG 500 

CGAAGCAGGA GATAACGTAG GTGTTCTTCT TCGTGGTACT ATUUVAAGAAG 550 

AAGTTATCCG TGGTATGGTT CTTGCTAAAC CAAAATCAAT TACTCCACAC 600 

ACTGACTTCG AAGCTGAAGT TTATATCTTA AATAAAGATG AAGGTGGTAG 650 

30 ACATACTCCA TTCTTTAACA ACTATAGACC ACAGTTTTAT GTAAGAACAA 700 

CTGATGTTAC AGGTTCGATT AAATTAGCTG ATGGTGTTGA AATGGTTATG 750 

CCAGGTGAAA ATGTGAGAAT TACTGTAAGC TTGATCGCTC CAGTAGCACT 800 

TGAAGA 806 



2) INFORMATION FOR SEQ ID NO: 1879 



(i) SEQUENCE CHARACTERISTICS: 
40 (A) LENGTH: 896 bases 

(B) TYPE: Nucleic acid 

(C) STRANDEDNESS: Double 

(D) TOPOLOGY: Linear 

45 (ii) MOLECULE TYPE: Genomic DNA 

(vi) ORIGINAL SOURCE: 

(A) ORGANISM: Coccidioides immitis 

(B) STRAIN: WSA~222 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 1879 

ATGTATGCAA CCGAGAGCAC TCCCGGATCT TGGTTTAAAT GGCACTAATA 50 
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TAAGACAGGC CTCAAACTCG AGAGCATTTA CTTCTCGCCC GTCAGATCGG 100 

TATCCAAAAA ATCGTCGTCT TCGTGAACAA GGTTGATGCC ATCGAGGACA 150 

AAGAGATGTT GGAGCTTGTT GAATTGGAGA TGCGTGAACT CCTAACCAGC 200 

TACGGTTTCS AGGGTGAAGA AACTCCCATC ATTTTTGGCT CTGCTCTCTG 250 

5 TGCCCTCSAA GGAAGACAAC CCGAGATCGG TGTTACCAAG ATTGATGAGC 300 

TCTTGCAGGC CGTCGACACC TGGATTCCCA CTCCTCAGCG TGAGACTGAC 350 

AAGCCCTTCT TGATGTCCAT TGAGGAAGTG TTCTCTATTT CCGGACGAGG 400 

AACCGTTGTC TCCGGCCGTG TGGAGCGTGG TATCCTCAAG AAGGACTCCG 450 

AAGTTGAAAT TGTCGGCGGT TCGCCCGAGC CAATCAAAAC CAAGGTTACC 500 

LO GATATCGAAA CCTTTAAGAA GTCTTGCGAC GAGTCTCGCG CTGGTGATAA 550 

CTCCGGCTTG CTCCTACGAG GCGTTAAGCG TGAAGATATT AGCCGTGGCA 600 

TGGTCGTCGC TGTACCAGGA AGTGTCAAGG CCCATACTGA ATTCTTAGTT 650 

TCGCTTTACG TCCTCACCGA AGCTGAGGGT GGGCGCAAAT CTGGATTCAG 7 00 

CAGC7\AGTAC CGCCCACAGA TGTTCATTCG CACTGCCGGT ATGTAATACT 750 

L5 GTGATAATTT CGTTGACATG GTACTGATTG AATTCTATAG ACGAAGCGGC 800 

TCAGCTCAGC TGGCCCGGAG AAGATCAAGA CAAGATGGCT ATGCCAGGAG 8 50 

ACAATATCGA AATGATTTGC ACCACCTTGC ACCCAGTTGC CGCCGA 8 96 



20 



2) INFORMATION FOR SEQ ID NO: 1880 



(i) SEQUENCE CHARACTERISTICS: 
(A) LENGTH: 798 bases 
15 (B) TYPE: Nucleic acid 

(C) STRANDEDNESS: Double 

(D) TOPOLOGY: Linear 



10 



(ii) MOLECULE TYPE: Genomic DNA 

(vi) ORIGINAL SOURCE: 

(A) ORGANISM: Erwinia amylo^rora 

(B) STRAIN: ATCC 14976 

IS (xi) SEQUENCE DESCRIPTION: SEQ ID NO: 1880 

CTGGTAGTTG CTGCGACTGA CGGCCCAATG CCTCAGACYC GTGAGCACAT 50 

CCTGCTGGGT CGCCAGGTTG GCGTGCCATA CATCATCGTG TTCCTGAACA 100 

AATGTGACAT GGTTGATGAT GAAGAGCTGC TGGAGCTGGT TGAAATGGAA 150 

10 GTMCGTGACC TGCTGTCACA GTACGACTTC CCAGGCGACG ACACGCCAAT 200 

CGTGCRYGGT TCTGCGCTGA AAGCGCTGGA RGGCGAAGCA GAGTGGGAAG 250 

CGAAGATCAT CGAACTGGCT GGCCATCTGG ATAACTACAT CCCGGAACCA 300 

GAGCGTGCGA TTGACAAACC GTTCCTGCTG CCAATTGAAG ACGTGTTCTC 350 

CATCTCTGGC CGTGGTACCG TTGTTACCGG TCGTGTAGAG CGCGGTRTSG 400 

15 TTAAAGTGGG TGAAGAAGTT GAAATCGTTG GTATCAAAGA TACCGTGAAA 4 50 

TCAACCTGTA CCGGCGTTGA GATGTTCCGT AAGCTGCTGG ACGAAGGCCG 500 

TGCGGGTGAG AACTGTGGTA TCCTGCTGCG CGGTATCAAG CGCGAAGATA 550 

TCCAGCGTGG TCAGGTTCTG GCG/^GCCAG GCACCATCAA GCCACACACC 600 

AAGTTCGAGT CAGAAGTTTA TATTCTGTCT AAAGACGAAG GCGGCCGTCA 650 

50 TACTCCGTTC TTCAAAGGCT ACCGTCCACA GTTCTACTTC CGTACTACTG 700 

ACGTGACCGG GACTATCGAA CTGCCAGAAG GCGTTGAGAT GGTGATGCCA 750 

GGCGACAACA TTCAGATGGT TGTGACCCTG ATCCACCCGA TCGCCATG 7 98 

987 



SUBSTITUTE SHEET (RULE 26) 



10 



15 



WO 01/23604 PCT/CAOO/01150 
2) INFORMATION FOR SEQ ID NO: 1881 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 810 bases 

(B) TYPE: Nucleic acid 

(C) STRANDEDNESS: Double 

(D) TOPOLOGY: Linear 

(ii) MOLECULE TYPE: Genomic DNA 

(vi) ORIGINAL SOURCE: 

(A) ORGANISM: Salmonella choleraesuis subsp. choleraesuis 
serotype Typhimurium 

(B) STRAIN: ATCC 14028 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 1881 



ATCCTGGTTG TTGCTGCGAC TGACGGYCCG ATGCCGCAGA CCCGTGAGCA 50 

CATCCTGCTG GGTCGTCAGG TAGGCGTTCC GTACATCATC GTGTTCCTGA 100 

20 ACAAATGCGA CATGGTTGAT GACGAAGAGC TGCTGGAACT GGTTGAAATG 150 

GAAGTTCGYG AACTGCTGTC TCAGTACGAC TTCCCGGGCG ACGACACTCC 200 

GATCGTTCGT GGTTCTGCTC TGAAAGCGCT GGAAGGCGAC GCAGAGTGGG 250 

AAGCGAAAAT CATCGAACTG GCTGGCTTCC TGGATTCTTA CATYCCGGAA 300 

CCAGAGCGTG CGATTGACAA GCCGTTCCTG CTGCCGATCG AAGACGTATT 350 

25 CTCCATCTCC GGTCGTGGTA CCGTTGTTAC CGGTCGTGTA GARCGCGGTA 400 

TCATC7\AAGT GGGCGAAGAA GTTGAAATCG TTGGTATCAA AGAGACTCAG 450 

AAGTCTACCT GTACTGGCGT TGAAATGTTC CGCAAACTGC TGGACGAAGG 500 

CCGTGCCGGT GAGAACGTAG GTGTTCTGCT GCGTGGTATC AAACGTGAAG 550 

AAATCGAACG TGGTCAGGTA CTGGCTAAGC CGGGCACCAT CAAGCCGCAC 600 

30 ACCAAGTTCG AATCTGAAGT GTACATTCTG TCCAAAGATG AAGGCGGCCG 650 

TCATACTCCG TTCTTCAAAG GCTACCGTCC GCAGTTCTAC TTCCGTACTA 700 

CTGACGTGAC TGGCACCATC GAACTGCCGG AAGGCGTAGA GATGGTAATG 750 

CCGGGCGACA ACATCAAAAT GGTTGTTACC CTGATCCACC CGATCGCGAT 800 

GGACGACGGT 810 

35 



2) INFORMATION FOR SEQ ID NO: 1882 

40 (i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 888 bases 

(B) TYPE: Nucleic acid 

(C) STRANDEDNESS: Double 

(D) TOPOLOGY: Linear 



45 



(ii) MOLECULE TYPE: Genomic DNA 



(vi) ORIGINAL SOURCE: 

(A) ORGANISM: Staphylococcus cohnii 
50 (B) STRAIN: BM10711 

(C) ACCESSION NUMBER : AF015628 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 1882 
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ATGAATTTTT 
ATACGGTATA 
AGGCAAACAA 
GAAGTTCCTA 
TGGAGACATA 
CAAAAAAAGG 
CCTTACGGAA 
GAATGGAGAT 
ATGATTTGCC 
GATAACGCAC 
TACAAATACA 
CTCCAGTGGG 
ATTATGGGCA 
AT AT GAT ATT 
AAAATAGCGA 
ATTACAAACG 
GGAACCTCAT 
TAAAATGTAA 



ATTTAGAGGA 
ACTTCATCAG 
AATCAGCAGT 
CCCCTGATGC 
TGGTTTACAG 
TGGCTTTACA 
TAACGGAAGG 
CGTATAGGAA 
AAATAAGGGA 
TTTGGTTCAC 
GGGAAATTAG 
TATCACTAGT 
ACAAAATAGG 
CCAACTCCAA 
AATATGGTTT 
ACAAAACAAT 
GGTATTACCT 
AATTGGGAAG 



GTTTAACTTG 
AAGACGGAAA 
CTAGATCAGA 
TAAAGTGATG 
AGAATGGTGC 
GAATATCCAT 
TCTAAATGGC 
AGTTGACAGC 
TCTTATCCTG 
GGAGAACCAA 
AAGAATATCC 
GGTAACGATG 
TCGAATCACT 
ACGCACGTCC 
ACTGAATGGG 
TCAAGAATAT 
TTGGAAAAGA 
CTGAATTTGA 



TCTATTCCCG 
GGTATGGTTC 
GTGGTAGGAT 
TGTTTAATTG 
AAATAAAATC 
TGCCACAGCC 
GATATATGGT 
TGATGGGACT 
CTTTTATTAC 
AATAATTCTA 
TCTACCAACA 
GTGCACTCTG 
ACAACTGGTG 
ACACGCTATA 
GGGCAAATCA 
CAACTTCAAA 
TGGATCCGTA 
ACGAATGA 



ATTCAGGTCC 50 

ACACAACATA 100 

AAAAGAATTC 150 

TATCTTCACT 200 

GGAAAGCTCT 250 

GGATTCTGGT 300 

TTACCCAATT 350 

ATTTATGAAT 4 00 

TTTAGGTTCG 450 

TTGGAAGGAT 500 

AATGCAGCGG 550 

GTTTGTCGAA 600 

AGATTAGCGA 650 

ACCGCGGGGA 700 

AATCGGCAGA 7 50 

CAGAAAATGC 800 

TGGTTTGCAT 850 

888 



20 



2)INF0£^TI0N FOR SEQ ID NO: 1883 

25 (i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 23 bases 

(B) TYPE: Nucleic acid 

(C) STRANDEDNESS: Single 

(D) TOPOLOGY: Linear 

30 

(ii) MOLECULE TYPE: DNA 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 1883 
35 AGCCGCTTGA GCAAATTAAA CTA 23 



40 



2) INFORMATION FOR SEQ ID NO: 1884 



(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 23 bases 

(B) TYPE: Nucleic acid 

(C) STRANDEDNESS: Single 
45 (D) TOPOLOGY: Linear 

(ii) MOLECULE TYPE: DNA 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 1884 

50 

GTATCCCGCA GATAAATCAC CAC 23 
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2) INFORMATION FOR SEQ ID NO: 1885 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 21 bases 

(B) TYPE: Nucleic acid 

(C) STRANDEDNESS: Single 

(D) TOPOLOGY: Linear 

(ii) MOLECULE TYPE: DNA 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 1885 
AGCGAAAAAC ACCTTGCCGA C 21 

2) INFORMATION FOR SEQ ID NO: 1886 



(i) SEQUENCE CHARACTERISTICS: 
20 (A) LENGTH: 18 bases 

(B) TYPE: Nucleic acid 

(C) STRANDEDNESS: Single 

(D) TOPOLOGY: Linear 

25 (ii) MOLECULE TYPE: DNA 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 1886 
GACGCCCGCG CCACCACT 18 

30 



2) INFORMATION FOR SEQ ID NO: 1887 

35 (i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 19 bases 

(B) TYPE: Nucleic acid 

(C) STRANDEDNESS: Single 

(D) TOPOLOGY: Linear 

40 

(ii) MOLECULE TYPE: DNA 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 18 87 
45 GACGCCCGCG ACACCACTA 



2) INFORMATION FOR SEQ ID NO: 1888 

50 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 19 bases 

(B) TYPE: Nucleic acid 
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(C) STRANDEDNESS : Single 

(D) TOPOLOGY: Linear 

(ii) MOLECULE TYPE: DNA 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 1888 
GACGCCCGCA ACACCACTA 19 

2) INFORMATION FOR SEQ ID NO: 1889 



(i) SEQUENCE CHARACTERISTICS: 
15 (A) LENGTH: 20 bases 

(B) TYPE: Nucleic acid 

(C) STRANDEDNESS: Single 

(D) TOPOLOGY: Linear 

20 (ii) MOLECULE TYPE: DNA 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 1889 
GTTCGCAACT GCAGCTGCTG 20 

25 



2) INFORMATION FOR SEQ ID NO: 1890 

30 (i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 19 bases 

(B) TYPE: Nucleic acid 

(C) STRANDEDNESS: Single 

(D) TOPOLOGY: Linear 

35 

(ii) MOLECULE TYPE: DNA 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 1890 
40 TTCGCAACGG CAGCTGCTG 19 



45 



2) INFORMATION FOR SEQ ID NO: 1891 



(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 18 bases 

(B) TYPE: Nucleic acid 

(C) STRANDEDNESS: Single 
50 (D) TOPOLOGY: Linear 

(ii) MOLECULE TYPE: DNA 
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18 



2) INFORMATION FOR SEQ ID NO: 1892 

(i) SEQUENCE CHARACTERISTICS: 
10 (A) LENGTH: 18 bases 

(B) TYPE: Nucleic acid 

(C) STRANDEDNESS: Single 

(D) TOPOLOGY: Linear 

15 (ii) MOLECULE TYPE: DNA 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 1892 
CGGAGCTGCC AARCGGGG 18 

20 

2) INFORMATION FOR SEQ ID NO: 1893 

25 (i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 18 bases 

(B) TYPE: Nucleic acid 

(C) STRANDEDNESS: Single 

(D) TOPOLOGY: Linear 

30 

(ii) MOLECULE TYPE: DNA 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 1893 
35 GGAGCTGGCG ARCGGGGT 18 



40 



2) INFOEIMATION FOR SEQ ID NO: 1894 



(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 18 bases 

(B) TYPE: Nucleic acid 

(C) STRANDEDNESS: Single 
45 (D) TOPOLOGY: Linear 

(ii) MOLECULE TYPE: DNA 

(Xi) SEQUENCE DESCRIPTION: SEQ ID NO: 1894 

50 

GACCGGAGCT AGCGARCG 18 
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2) INFORMATION FOR SEQ ID NO: 1895 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 19 bases 

(B) TYPE: Nucleic acid 

(C) STRANDEDNESS: Single 

(D) TOPOLOGY: Linear 

(ii) MOLECULE TYPE: DNA 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 1895 
CGGAGCTAGC AARCGGGGT 



2) INFORMATION FOR SEQ ID NO: 1896 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 21 bases 

(B) TYPE: Nucleic acid 

(C) STRANDEDNESS: Single 

(D) TOPOLOGY: Linear 

(ii) MOLECULE TYPE: DNA 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 1896 
GAAACGGAAC TGAATGAGGC G 



2) INFORMATION FOR SEQ ID NO: 1897 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 21 bases 

(B) TYPE: Nucleic acid 

(C) STRANDEDNESS: Single 

(D) TOPOLOGY: Linear 

(ii) MOLECULE TYPE: DNA 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 1897 
CATTACCATG GGCGATAACA G 



2) INFORMATION FOR SEQ ID NO: 1898 

(i) SEQUENCE CHARACTERISTICS: 
(A) LENGTH: 22 bases 
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(B) TYPE: Nucleic acid 

(C) STRANDEDNESS: Single 

(D) TOPOLOGY: Linear 

5 (ii) MOLECULE TYPE: DNA 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 1898 
CCATTACCAT GAGCGATAAC AG 22 

10 



2) INFORMATION FOR SEQ ID NO: 1899 

15 (i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 861 bases 

(B) TYPE: Nucleic acid 

(C) STRANDEDNESS: Double 

(D) TOPOLOGY: Linear 



20 



(ii) MOLECULE TYPE: Genomic DNA 



(vi) ORIGINAL SOURCE: 

(A) ORGANISM: Klebsiella pneumoniae 
25 (B) STRAIN: 15571 

(C) ACCESSION NUMBER: AF124984 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 1899 

30 ATGCGTTATA TTCGCCTGTG TATTATCTCC CTGTTAGCCA CCCTGCCGCT 50 

GGCGGTACAC GCCAGCCCGC AGCCGCTTGA GCAAATTAAA CTAAGCGAAA 100 

GCCAGCTGTC GGGCCGCGTA GGCATGATAG AAATGGATCT GGCCAGCGGC 150 

CGCACGCTGA CCGCCTGGCG CGCCGATGAA CGCTTTCCCA TGATGAGCAC 200 

CTTTAAAGTA GTGCTCTGCG GCGCAGTGCT GGCGCGGGTG GATGCCGGTG 250 

35 ACGAACAGCT GGAGCGAAAG ATCCACTATC GCCAGCAGGA TCTGGTGGAC 300 

TACTCGCCGG TCAGCGAAAA ACACCTTGCC GACGGCATGA CGGTCGGCGA 350 

ACTCTGCGCC GCCGCCATTA CCATGAGCGA TAACAGCGCC GCCAATCTGC 400 

TACTGGCCAC CGTCGGCGGC CCCGCAGGAT TGACTGCCTT TTTGCGCCAG 450 

ATCGGCGACA ACGTCACCCG CCTTGACCGC TGGGAAACGG AACTGAATGA 500 

40 GGCGCTTCCC GGCGACGCCC GCGACACCAC TACCCCGGCC AGCATGGCCG 550 

CGACCCTGCG CAAGCTGCTG ACCAGCCAGC GTCTGAGCGC CCGTTCGCAA 600 

CGGCAGCTGC TGCAGTGGAT GGTGGACGAT CGGGTCGCCG GACCGTTGAT 650 

CCGCTCCGTG CTGCCGGCGG GCTGGTTTAT CGCCGATAAG ACCGGAGCTG 700 

GCGAGCGGGG TGCGCGCGGG ATTGTCGCCC TGCTTGGCCC GAATAACAAA 7 50 

45 GCAGAGCGCA TTGTGGTGAT TTATCTGCGG GATACCCCGG CGAGCATGGC 800 

CGAGCGAAAT CAGCAAATCG CCGGGATCGG CGCGGCGCTG ATCGAGCACT 850 

GGCAACGCTA A 861 



50 

2) INFORMATION FOR SEQ ID NO: 1900 

(i) SEQUENCE CHARACTERISTICS: 
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(A) LENGTH: 780 bases 

(B) TYPE: Nucleic acid 

(C) STRANDEDNESS: Double 

(D) TOPOLOGY: Linear 

5 

(ii) MOLECULE TYPE: Genomic DNA 

(vi) ORIGINAL SOURCE: 

(A) ORGANISM: Klebsiella pneumoniae 
10 (B) STRAIN: SLK-47 

(C) ACCESSION NUMBER: Y11069 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 1900 

15 CTGTTAGCCA CCCTGCCGCT GGCGGTACAC GCCAGCCCGC AGCCGCTTGA 50 

GCAAATTAAA CTAAGCGAAA GCCAGCTGTC GGGCCGCGTA GGCATGATAG 100 

AAATGGATCT GGCCAGCGGC CGCACGCTGA CCGCCTGGCG CGCCGATGAA 150 

CGCTTTCCCA TGATGAGCAC CTTTAAAGTA GTGCTCTGCG GCGCAGTGCT 200 

GGCGCGGGTG GATGCCGGTG ACGAACAGCT GGAGCGAAAG ATCCACTATC 250 

20 GCCAGCAGGA TCTGGTGGAC TACTCGCCGG TCAGCGAAAA ACACCTTGCC 300 

GACGGCATGA CGGTCGGCGA ACTCTGCGCC GCCGCCATTA CCATGAGCGA 350 

TAACAGCGCC GCCAATCTGC TACTGGCCAC CGTCGGCGGC CCCGCAGGAT 400 

TGACTGCCTT TTTGCGCCAG ATCGGCGACA ACGTCACCCG CCTTGACCGC 450 

TGGGAJ^CGG AACTGAATGA GGCGCTTCCC GGCGACGCCC GCGCCACCAC 500 

25 TACCCCGGCC AGCATGGCCG CGACCCTGCG CAAGCTGCTG ACCAGCCAGC 550 

GTCTGAGCGC CCGTTCGCAA CGGCAGCTGC TGCAGTGGAT GGTGGACGAT 600 

CGGGTCGCCG GACCGTTGAT CCGCTCCGTG CTGCCGGCGG GCTGGTTTAT 650 

CGCCGATAAG ACCGGAGCTG GCGAGCGGGG TGCGCGCGGG ATTGTCGCCC 700 

TGCTTGGCCC GAATAACAAA GCAGAGCGCA TTGTGGTGAT TTATCTGCGG 750 

30 GATACCCCGG CGAGCATGGC CGAGCGAAAT 780 



2) INFORMATION FOR SEQ ID NO: 1901 



35 



(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 861 bases 

(B) TYPE: Nucleic acid 

(C) STRANDEDNESS: Double 
40 (D) TOPOLOGY: Linear 

(ii) MOLECULE TYPE: Genomic DNA 

(vi) ORIGINAL SOURCE: 
45 (A) ORGANISM: Escherichia coli 

(C) ACCESSION NUMBER: U92041 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 1901 

50 ATGCGTTATA TTCGCCTGTG TATTATCTCC CTGTTAGCCA CCCTGCCGCT 50 

GGCGGTACAC GCCAGCCCGC AGCCGCTTGA GCAAATTAAA CTAAGCGAAA 100 

GCCAGCTGTC GGGCCGCGTA GGCATGATAG AAATGGATCT GGCCAGCGGC 150 

CGCACGCTGA CCGCCTGGCG CGCCGATGAA CGCTTTCCCA TGATGAGCAC 200 
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CTTTAAAGTA 
ACGAACAGCT 
TACTCGCCGG 
ACTCTGCGCC 
TACTGGCCAC 
ATCGGCGACA 
GGCGCTTCCC 
CGACCCTGCG 
CGGCAGCTGC 
CCGCTCCGTG 
GCGAGCGGGG 
GCAGAGCGCA 
CGAGCGAAAT 
GGCAACGCTA 



GTGCTCTGCG 
GGAGCGAAAG 
TCAGCGAAAA 
GCCGCCATTA 
CGTCGGCGGC 
ACGTCACCCG 
GGCGACGCCC 
CAAGCTGCTG 
TGCAGTGGAT 
CTGCCGGCGG 
TGCGCGCGGG 
TTGTGGTGAT 
CAGCAAATCG 
A 



GCGCAGTGCT 
ATCCACTATC 
ACACCTTGCC 
CCATGAGCGA 
CCCGCAGGAT 
CCTTGACCGC 
GCAACACCAC 
ACCAGCCAGC 
GGTGGACGAT 
GCTGGTTTAT 
ATTGTCGCCC 
TTATCTGCGG 
CCGGGATCGG 



GGCGCGGGTG 
GCCAGCAGGA 
GACGGCATGA 
TAACAGCGCC 
TGACTGCCTT 
TGGGAAACGG 
TACCCCGGCC 
GTCTGAGCGC 
CGGGTCGCCG 
CGCCGATAAG 
TGCTTGGCCC 
GATACCCCGG 
CGCGGCGCTG 



GATGCCGGTG 
TCTGGTGGAC 
CGGTCGGCGA 
GCCAATCTGC 
TTTGCGCCAG 
AACTGT^TGA 
AGCATGGCCG 
CCGTTCGCAA 
GACCGTTGAT 
ACCGGAGCTG 
GAATAACAAA 
CGAGCATGGC 
ATCGAGCACT 



250 
300 
350 
400 
450 
500 
550 
600 
650 
700 
750 
800 
850 
861 



2 ) INFORMATION FOR SEQ ID NO: 1902 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 861 bases 

(B) TYPE: Nucleic acid 

(C) STEIANDEDNESS : Double 

(D) TOPOLOGY: Linear 

(ii) MOLECULE TYPE: Genomic DNA 

(vi) ORIGINAL SOURCE: 

(A) ORGANISM: Klebsiella pneumoniae 

(B) STRAIN: 803 

(C) ACCESSION NUMBER: AF164577 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 1902 

ATGCGTTATA TTCGCCTGTG TATTATCTCC CTGTTAGCCA CCCTGCCGCT 50 

GGCGGTACAC GCCAGCCCGC AGCCGCTTGA GCAAATTAAA CAAAGCGAAA 100 

GCCAGCTGTC GGGCCGCGTA GGCATGATAG AAATGGATCT GGCCAGCGGC 150 

CGCACGCTGA CCGCCTGGCG CGCCGATGAA CGCTTTCCCA TGATGAGCAC 200 

CTTTAAAGTA GTGCTCTGCG GCGCAGTGCT GGCGCGGGTG GATGCCGGTG 250 

ACGAACAGCT GGAGCGAAAG ATCCACTATC GCCAGCAGGA TCTGGTGGAC 300 

TACTCGCCGG TCAGCGAAAA ACACCTTGCC GACGGCATGA CGGTCGGCGA 350 

ACTCTGCGCC GCCGCCATTA CCATGAGCGA TAACAGCGCC GCCAATCTGC 400 

TGCTGGCCAC CGTCGGCGGC CCCGCAGGAT TGACTGCCTT TTTGCGCCAG 450 

ATCGGCGACA ACGTCACCCG CCTTGACCGC TGGGAAACGG AACTGAATGA 500 

GGCGCTTCCC GGCGACGCCC GCGACACCAC TACCCCGGCC AGCATGGCCG 550 

CGACCCTGCG CAAGCTGCTG ACCAGCCAGC GTCTGAGCGC CCGTTCGCAA 600 

CGGCAGCTGC TGCAGTGGAT GGTGGACGAT CGGGTCGCCG GACCGTTGAT 650 

CCGCTCCGTG CTGCCGGCGG GCTGGTTTAT CGCCGATAAG ACCGGAGCTG 700 

CCGAGCGGGG TGCGCGCGGG ATTGTCGCCC TGCTTGGCCC GAATAACAAA 750 

GCAGAGCGCA TTGTGGTGAT TTATCTGCGG GATACGCCGG CGAGCATGGC 800 

CGAGCGAAAT CAGCAAATCG CCGGGATCGG CGCGGCGCTG ATCGAGCACT 850 

GGCAACGCTA A 861 
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2) INFORMATION FOR SEQ ID NO: 1903 



(i) 



SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 8 96 bases 

(B) TYPE: Nucleic acid 

(C) STRANDEDNESS: Double 

(D) TOPOLOGY: Linear 



5 



(ii) 



MOLECULE TYPE: Genomic DNA 



10 



(vi) 



ORIGINAL SOURCE: 

(A) ORGANISM: Klebsiella pneumoniae 

(B) STRAIN: ATCC 700603 

(C) ACCESSION NUMBER: AF132290 



15 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 1903 

ATGCGTTATT TTCGCCTGTG TATTATCTCC CTGTTAGCCA CCCTGCCGCT 
GGCGGTACAC GCCAGCCCGC AGCCGCTTGA GCAAATTAAA CTAAGCGAAA 

20 GCCAGCTGTC GGGCAGCGTA GGCATGATAG AAATGGATCT GGCCAGCGGC 
CGCACGCTGA CCGCCTGGCG CGCCGATGAA CGCTTTCCCA TGATGAGCAC 
CTTTATU^GTA GTGCTCTGCG GCGCAGTGCT GGCGCGGGTG GATGCCGGTG 
ACGAACAGCT GGAGCGAAAG ATCCACTATC GCCAGCAGGA TCTGGTGGAC 
TACTCGCCGG TCAGCGAAAA ACACCTTGCC GACGGCATGA CGGTCGGCGA 

25 ACTCTGTGCC GCCGCCATTA CCATGAGCGA TAACAGCGCC GCCAATCTGC 
TGCTGGCCAC CGTCGGCGGC CCCGCAGGAT TGACTGCCTT TTTGCGCCAG 
ATCGGCGACA ACGTCACCCG CCTTGACCGC TGGGAAACGG AACTGAATGA 
GGCGCTTCCC GGCGACGCCC GCGACACCAC TACCCCGGCC AGCATGGCCG 
CGACCCTGCG CAAGCTGCTG ACCAGCCAGC GTCTGAGCGC CCGTTCGCAA 

30 CGGCAGCTGC TGCAGTGGAT GGTGGACGAT CGGGTCGCCG GACCGTTGAT 
CCGCTCCGTG CTGCCGGCGG GCTGGTTTAT CGCCGATAAG ACCGGAGCTG 
CCAAACGGGG TGCGCGCGGG ATTGTCGCCC TGCTTGGCCC GAATAACAAA 
GCAGAGCGGA TTGTGGTGAT TTATCTGCGG GATACGCCGG CGAGCATGGC 
CGAGCGAAAT CAGCAAATCG CCGGGATCGG CGCGGCGCTG ATCGAGCACT 

35 GGCAACGCTA ACCCGGCGGT GGCCGCGCGC GTTATCCGGC TCGTAG 



2) INFORMATION FOR SEQ ID NO: 1904 



50 
100 
150 
200 
250 
300 
350 
400 
450 
500 
550 
600 
650 
700 
750 
800 
850 
896 



40 



(i) 



SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 8 61 bases 

(B) TYPE: Nucleic acid 

(C) STRANDEDNESS: Double 

(D) TOPOLOGY: Linear 



45 



(ii) 



MOLECULE TYPE: Genomic DNA 



(vi) 



ORIGINAL SOURCE: 

(A) ORGANISM: Escherichia coli 

(B) STRAIN: JC2926 

(C) ACCESSION NUMBER: AF148851 



50 
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(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 1904 

ATGCGTTATA TTCGCCTGTG TATTATCTCC CTGTTAGCCA CCCTGCCGCT 50 

GGCGGTACAC GCCAGCCCGC AGCCGCTTGA GCAAATTAAA CTAAGCGAAA 100 

5 GCCAGCTGTC GGGCCGCGTA GGCATGATAG AAATGGATCT GGCCAGCGGC 150 

CGCACGCTGA CCGCCTGGCG CGCCGATGAA CGCTTTCCCA TGATGAGCAC 200 

CTTTAAAGTA GTGCTCTGCG GCGCAGTGCT GGCGCGGGTG GATGCCGGTG 250 

ACGAACAGCT GGAGCGAAAG ATCCACTATC GCCAGCAGGA TCTGGTGGAC 300 

TACTCGCCGG TCAGCGAAAA ACACCTTGCC GACGGCATGA CGGTCGGCGA 350 

10 ACTCTGCGCC GCCGCCATTA CCATGAGCGA TAACAGCGCC GCCAATCTGC 4 00 

TACTGGCCAC CGTCGGCGGC CCCGCAGGAT TGACTGCCTT TTTGCGCCAG 4 50 

ATCGGCGACA ACGTCACCCG CCTTGACCGC TGGGAAACGG AACTGAATGA 500 

GGCGCTTCCC GGCGACGCCC GCGACACCAC TACCCCGGCC AGCATGGCCG 5 50 

CGACCCTGCG CAAGCTGCTG ACCAGCCAGC GTCTGAGCGC CCGTTCGCAA 600 

15 CGGCAGCTGC TGCAGTGGAT GGTGGACGAT CGGGTCGCCG GACCGTTGAT 650 

CCGCTCCGTG CTGCCGGCGG GCTGGTTTAT CGCCGATAAG ACCGGAGCTA 700 

GCGAGCGGGG TGCGCGCGGG ATTGTCGCCC TGCTTGGCCC GAATAACAAA 7 50 

GCAGAGCGCA TTGTGGTGAT TTATCTGCGG GATACCCCGG CGAGCATGGC 800 

CGAGCGAAAT CAGCAAATCG CCGGGATCGG CGCGGCGCTG ATCGAGCACT 850 

20 GGCAACGCTA A 8 61 



2) INFORMATION FOR SEQ ID NO: 1905 



25 



(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 8 61 bases 

(B) TYPE: Nucleic acid 

(C) STRANDEDNESS: Double 
30 . (D) TOPOLOGY: Linear 

(ii) MOLECULE TYPE: Genomic DNA 

(vi) ORIGINAL SOURCE: 
35 (A) ORGANISM: Pseudomonas aeruginosa 

(C) ACCESSION NUMBER: AF096930 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 1905 

40 ATGCGTTATA TTCGCCTGTG TATTATCTCC CTGTTAGCCA CCCTGCCGCT 50 

GGCGGTACAC GCCAGCCCGC AGCCGCTTGA GCAAATTAAA CTAAGCGAAA 100 

GCCAGCTGTC GGGCCGCGTA GGCATGATAG AAATGGATCT GGCCAGCGGC 150 

CGCACGCTGA CCGCCTGGCG CGCCGATGAA CGCTTTCCCA TGATGAGCAC 200 

CTTTAAAGTA GTGCTCTGCG GCGCAGTGCT GGCGCGGGTG GATGCCGGTG 250 

45 ACGAACAGCT GGAGCGAAAG ATCCACTATC GCCAGCAGGA TCTGGTGGAC 300 

TACTCGCCGG TCAGCGAAAA ACACCTTGCC GACGGCATGA CGGTCGGCGA 350 

ACTCTGCGCC GCCGCCATTA CCATGAGCGA TAACAGCGCC GCCAATCTAC 4 00 

TACTGGCCAC CGTCGGCGGC CCCGCAGGAT TGACTGCCTT TTTGCGCCAG 4 50 

ATCGGCGACA ACGTCACCCG CCTTGACCGC TGGG7\AACGG AACTGAATGA 500 

50 GGCGCTTCCC GGCGACGCCC GCGACACCAC TACCCCGGCC AGCATGGCCG 550 

CGACCCTGCG CAAGCTGCTG ACCAGCCAGC GTCTGAGCGC CCGTTCGCAA 600 

CGGCAGCTGC TGCAGTGGAT GGTGGACGAT CGGGTCGCCG GACCGTTGAT 650 

CCGCTCCGTG CTGCCGGCGG GCTGGTTTAT CGCCGATAAG ACCGGAGCTA 700 
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GCAAACGGGG TGCGCGCGGG ATTGTCGCCC 
GCAGAGCGCA TTGTGGTGAT TTATCTGCGG 
CGAGCGAAAT CAGCAAATCG CCGGGATCGG 
GGCAACGCTA A 



PCT/CAOO/01150 

TGCTTGGCCC GAATAACAAA 750 
GATACGCCGG CGAGCATGGC 800 
CGCGGCGCTG ATCGAGCACT 850 

861 



2) INFORMATION FOR SEQ ID NO: 1906 

10 (i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 20 bases 

(B) TYPE: Nucleic acid 

(C) STRANDEDNESS : Single 

(D) TOPOLOGY: Linear 

15 

(ii) MOLECULE TYPE: DNA 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 1906 
20 CCTTATTCCC TTTTTTGCGG 20 



25 



2) INFORMATION FOR SEQ ID NO: 1907 



(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 22 bases 

(B) TYPE: Nucleic acid 

(C) STRANDEDNESS: Single 
30 (D) TOPOLOGY: Linear 

(ii) MOLECULE TYPE: DNA 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 1907 

35 

CACCTATCTC AGCGATCTGT CT 22 



iO 2) INFORMATION FOR SEQ ID NO: 1908 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 23 bases 

(B) TYPE: Nucleic acid 
45 (C) STRANDEDNESS: Single 

(D) TOPOLOGY: Linear 

(ii) MOLECULE TYPE: DNA 
50 (xi) SEQUENCE DESCRIPTION: SEQ ID NO: 1908 

AACAGCGGTA. AGATCCTTGA GAG 23 
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2) INFORMATION FOR SEQ ID NO: 1909 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 22 bases 

(B) TYPE: Nucleic acid 

(C) STRANDEDNESS: Single 

(D) TOPOLOGY: Linear 

(ii) MOLECULE TYPE: DNA 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 1909 
ATGACTTGGT TAAGTACTCA CC 



2) INFORMATION FOR SEQ ID NO: 1910 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 22 bases 

(B) TYPE: Nucleic acid 

(C) STRANDEDNESS: Single 

(D) TOPOLOGY: Linear 

(ii) MOLECULE TYPE: DNA 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 1910 
ATGACTTGGT TGAGTACTCA CC 



2) INFORMATION FOR SEQ ID NO: 1911 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 22 bases 

(B) TYPE: Nucleic acid 

(C) STRANDEDNESS: Single 

(D) TOPOLOGY: Linear 

(ii) MOLECULE TYPE: DNA 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 1911 
CCATAACCAT GGGTGATAAC AC 



2) INFORMATION FOR SEQ ID NO: 1912 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 22 bases 

(B) TYPE: Nucleic acid 

1000 

SUBSTITUTE SHEET (RULE 26) 



10 



45 



WO 01/23604 PCT/CAOO/01150 

(C) STRANDEDNESS: Single 

(D) TOPOLOGY: Linear 

(ii) MOLECULE TYPE: DNA 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 1912 
CCATAACCAT GAGTGATAAC AC 22 



2) INFORMATION FOR SEQ ID NO: 1913 



(i) SEQUENCE CHARACTERISTICS: 
15 (A) LENGTH: 20 bases 

(B) TYPE: Nucleic acid 

(C) STRANDEDNESS: Single 

(D) TOPOLOGY: Linear 

20 (ii) MOLECULE TYPE: DNA 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 1913 
CGCCTTGATC ATTGGGAACC 20 

25 

2) INFORMATION FOR SEQ ID NO: 1914 

30 (i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 20 bases 

(B) TYPE: Nucleic acid 

(C) STRANDEDNESS: Single 

(D) TOPOLOGY: Linear 

35 

(ii) MOLECULE TYPE: DNA 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 1914 
40 CGCCTTGATC GTTGGGAACC 20 



2) INFORMATION FOR SEQ ID NO: 1915 



(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 20 bases 

(B) TYPE: Nucleic acid 

(C) STRANDEDNESS: Single 
50 (D) TOPOLOGY: Linear 

(ii) MOLECULE TYPE: DNA 
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(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 1915 

CGCCTTGATA GTTGGGAACC 20 

5 

2) INFORMATION FOR SEQ ID NO: 1916 

(i) SEQUENCE CHARACTERISTICS: 
10 (A) LENGTH: 20 bases 

(B) TYPE: Nucleic acid 

(C) STRANDEDNESS: Single 

(D) TOPOLOGY: Linear 

15 (ii) MOLECULE TYPE: DNA 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 1916 
CGTGGGTCTT GCGGTATCAT 20 

20 

2) INFORMATION FOR SEQ ID NO: 1917 

25 (i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 20 bases 

(B) TYPE: Nucleic acid 

(C) STRANDEDNESS: Single 

(D) TOPOLOGY: Linear 

30 

(ii) MOLECULE TYPE: DNA 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 1917 
35 CGTGGGTCTG GCGGTATCAT 20 



40 



2) INFORMATION FOR SEQ ID NO: 1918 



(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 21 bases 

(B) TYPE: Nucleic acid 

(C) STRANDEDNESS: Single 
45 (D) TOPOLOGY: Linear 

(ii) MOLECULE TYPE: DNA 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 1918 

50 

GTGGGTCTCA CGGTATCATT G 21 
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15 



2) INFORMATION FOR SEQ ID NO: 1919 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 21 bases 

(B) TYPE: Nucleic acid 

(C) STRANDEDNESS: Single 

(D) TOPOLOGY: Linear 

(ii) MOLECULE TYPE: DNA 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 1919 
CGTGGGTCTC TCGGTATCAT T 21 



2) INFORMATION FOR SEQ ID NO: 1920 



(i) SEQUENCE CHARACTERISTICS: 
20 (A) LENGTH: 20 bases 

(B) TYPE: Nucleic acid 

(C) STRANDEDNESS: Single 

(D) TOPOLOGY: Linear 

25 (ii) MOLECULE TYPE: DNA 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 1920 
CGTGGNTCTC GCGGTATCAT '20 

30 



2) INFORMATION FOR SEQ ID NO: 1921 

35 (i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 21 bases 

(B) TYPE: Nucleic acid 

(C) STRANDEDNESS: Single 

(D) TOPOLOGY: Linear 

iO 

(ii) MOLECULE TYPE: DNA 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 1921 
45 CGTGGGTCTA GCGGTATCAT T 21 



2) INFORMATION FOR SEQ ID NO: 1922 

50 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 24 bases 

(B) TYPE: Nucleic acid 

1003 
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(C) STRANDEDNESS: Single 

(D) TOPOLOGY: Linear 

(ii) MOLECULE TYPE: DNA 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 1922 
GTTTTCCAAT GATTAGCACT TTTA 24 



2) INFORMATION FOR SEQ ID NO: 1923 



(i) SEQUENCE CHARACTERISTICS: 
15 (A) LENGTH: 24 bases 

(B) TYPE: Nucleic acid 

(C) STRANDEDNESS: Single 

(D) TOPOLOGY: Linear 

2 0 (ii) MOLECULE TYPE: DNA 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 1923 
GTTTTCCAAT GATAAGCACT TTTA 24 

25 

2) INFORMATION FOR SEQ ID NO: 1924 

30 (i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 23 bases 

(B) TYPE: Nucleic acid 

(C) STRANDEDNESS: Single 

(D) TOPOLOGY: Linear 

35 

(ii) MOLECULE TYPE: DNA 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 1924 
40 GTTTTCCAAT GCTGAGCACT TTT 23 



45 



2) INFORMATION FOR SEQ ID NO: 1925 



(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 23 bases 

(B) TYPE: Nucleic acid 

(C) STRANDEDNESS: Single 
50 (D) TOPOLOGY: Linear 

(ii) MOLECULE TYPE: DNA 
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(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 1925 



CGTTTTCCAA TGATGAGCAC TTT 23 



2) INFORMATION FOR SEQ ID NO: 1926 

(i) SEQUENCE CHARACTERISTICS: 
10 (A) LENGTH: 23 bases 

(B) TYPE: Nucleic acid 

(C) STRANDEDNESS: Single • 

(D) TOPOLOGY: Linear 

15 (ii) MOLECULE TYPE: DNA 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 1926 
GTTTTCCAAT GGTGAGCACT TTT 23 

20 



2) INFORMATION FOR SEQ ID NO: 1927 

25 (i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 861 bases 

(B) TYPE: Nucleic acid 

(C) STRANDEDNESS: Double 

(D) TOPOLOGY: Linear 



30 



(ii) MOLECULE TYPE: Genomic DNA 



(vi) ORIGINAL SOURCE: 

(A) ORGANISM: Neisseria meningitidis 
35 (B) STRAIN: MC9690-129 

(C) ACCESSION NUMBER: AF126482 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 1927 

40 ATGAGTATTC AACATTTTCG TGTCGCCCTT ATTCCCTTTT TTGCGGCATT 50 

TTGCCTTCCT GTTTTTGCTC ACCCAGAAAC GCTGGTGAAA GTAAAAGATG 100 

CTGAAGATCA GTTGGGTGCA CGAGTGGGTT ACATCGAACT GGATCTCAAC 150 

AGCGGTAAGA TCCTTGAGAG TTTTCGCCCC GAAGAACGTT TTCCAATGAT 200 

GAGCACTTTT AAAGTTCTGC TATGTGGTGC GGTATTATCC CGTGTTGACG 250 

45 CCGGGCAAGA GCAACTCGGT CGCCGCATAC ACTATTCTCA GAATGACTTG 300 

GTTGAGTACT CACCAGTCAC AGAAAAGCAT CTTACGGATG GCATGACAGT 350 

AAGAGAATTA TGCAGTGCTG CCATAACCAT GAGTGATAAC ACTGCTGCCA 400 

ACTTACTTCT GACAACGATC GGAGGACCGA AGGAGCTAAC CGCTTTTTTG 450 

CACAACATGG GGGATCATGT AACTCGCCTT GATCGTTGGG AACCGGAGCT 500 

50 GAATGAAGCC ATACCAAACG ACGAGCGTGA CACCACGATG CCTGCAGCAA 550 

TGGCAACAAC GTTGCGCAAA CTATTAACTG GCGAACTACT TACTCTAGCT 600 

TCCCGGCAAC AATTAATAGA CTGGATGGAG GCGGATAAAG TTGCAGGACC 650 

ACTTCTGCGC TCGGCCCTTC CGGCTGGCTG GTTTATTGCT GATAAATCTG 700 
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GAGCCGGTGA GCGTGGGTCT CGCGGTATCA 
GGTAAGCCCT CCCGTATCGT AGTTATCTAC 
TATGGATGAA CGAAATAGAC AGATCGCTGA 
AGCATTGGTA A 
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TTGCAGCACT GGGGCCAGAT 750 
ACGACGGGGA GTCAGGCAAC 800 
GATAGGTGCC TCACTGATTA 850 

861 



2) INFORMATION FOR SEQ ID NO: 1928 

10 (i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 861 bases 

(B) TYPE: Nucleic acid 

(C) STRANDEDNESS : Double 

(D) TOPOLOGY: Linear 



15 



(ii) MOLECULE TYPE: Genomic DNA 



(vi) ORIGINAL SOURCE: 

(A) ORGANISM: Escherichia coli 
20 (B) STRAIN: HB251 

(C) ACCESSION NUMBER: X57 972 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 1928 

25 ATGAGTATTC AACATTTCCG TGTCGCCCTT ATTCCCTTTT TTGCGGCATT 50 

TTGCCTTCCT GTTTTTGCTC ACCCAGAAAC GCTGGTGAAA GTAAAAGATG 100 

CTGAAGATCA GTTGGGTGCA CGAGTGGGTT ACATCGAACT GGATCTCAAC 150 

AGCGGTAAGA TCCTTGAGAG TTTTCGCCCC GAAGAACGTT TTCCAATGAT 200 

GAGCACTTTT AAAGTTCTGC TATGTGGCGC GGTATTATCC CGTGTTGACG 250 

30 CCGGGCAAGA GCAACTCGGT CGCCGCATAC ACTATTCTCA GAATGACTTG 300 

GTTAAGTACT CACCAGTCAC AGAAAAGCAT CTTACGGATG GCATGACAGT 350 

AAGAGAATTA TGCAGTGCTG CCATAACCAT GAGTGATAAC ACTGCGGCCA 400 

ACTTACTTCT GACAACGATC GGAGGACCGA AGGAGCTAAC CGCTTTTTTG 450 

CACAACATGG GGGATCATGT AACTCGCCTT GATCATTGGG AACCGGAGCT 500 

35 GAATGAAGCC ATACCAAACG ACGAGCGTGA CACCACGATG CCTGCAGCT^ 550 

TGGCAACAAC GTTGCGCAAA CTATTAACTG GCGAACTACT TACTCTAGCT 600 

TCCCGGCAAC AATTAATAGA CTGGATGGAG GCGGATAAAG TTGCAGGACC 650 

ACTTCTGCGC TCGGCCCTTC CGGCTGGCTG GTTTATTGCT GATAAATCTG 7 00 

GAGCCGGTGA GCGTGGGTCT CGCGGTATCA TTGCAGCACT GGGGCCAGAT 7 50 

40 GGTAAGCCCT CCCGTATCGT AGTTATCTAC ACGACGGGGA GTCAGGCAAC 800 

TATGGATGAA CGAAATAGAC AGATCGCTGA GATAGGTGCC TCACTGATTA 850 

AGCATTGGTA A 861 



i5 

2) INFORMATION FOR SEQ ID NO: 1929 

(i) SEQUENCE CHARACTERISTICS: 
(A) LENGTH: 808 bases 
50 (B) TYPE: Nucleic acid 

(C) STRANDEDNESS: Double 

(D) TOPOLOGY: Linear 
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(ii) MOLECULE TYPE: Genomic DNA 

(vi) ORIGINAL SOURCE: 

(A) ORGANISM: Klebsiella oxytoca 
5 (B) STRAIN: 26W 

(C) ACCESSION NUMBER: U09188 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 1929 

10 ATGAGTATTC AACATTTCCG TGTCGCCCTT ATTCCCTTTT TTGCGGCATT 50 

TTGCCTTCCT GTTTTTGCTC ACCCAGAAAC GCTGGTGAAA GTAAAAGATG 100 

CTGAAGATCA GTTGGGTGCA CGAGTGGGTT ACATCGAGCT GGATCTCAAC 150 

AGCGGTAAGA TCCTTGAGAG TTTTCGCCCC GAAGAACGTT TTCCAATGAT 200 

GAGCACTTTT AAAGTTCTGC TATGTGGTGC GGTATTATCC CGTGTTGACG 250 

15 CCGGGCAAGA GCAACTCGGT CGCCGCATAC ACTATTCTCA GAATGACTTG 300 

GTTGAGTACT CACCAGTCAC AGAAAAGCAT CTTACGGATG GCATGACAGT 350 

AAGAGAATTA TGCAGTGCTG CCATAACCAT GAGTGATAAC ACTGCGGCCA 400 

ACTTACTTCT GACAACGATC GGAGGACCGA AGGAGCTAAC CGCTTTTTTG 4 50 

CACAACATGG GGGATCATGT AACCCGCCTT GATAGTTGGG AACCGGAGCT 500 

20 GAATGAAGCC ATACCAAACG ACGAGCGTGA CACCACGATG CCTGCAGCAA 550 

TGGCAACAAC GTTGCGCAAA CTATTAACTG GCGAACTACT TACTCTAGCT 600 

TCCCGGCAAC AATTAATAGA CTGGATGGAG GCGGATAAAG TTGCAGGACC 650 

ACTTCTGCGC TCGGCCCTTC CGGCTGGCTG GTTTATTGCT GATAAATCTG 700 

GAGCCGGTAA GCGTGGATCT CGCGGTATCA TTGCAGCACT GGGGCCAGAT 750 

25 GGTAAGCCCT CCCGTATCGT AGTTATCTAC ACGACGGGGA GTCAGGCAAC 800 

TATGGATG 808 



30 2) INFORMATION FOR SEQ ID NO: 1930 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 8 61 bases 

(B) TYPE: Nucleic acid 
35 (C) STRANDEDNESS: Double 

(D) TOPOLOGY: Linear 

(ii) MOLECULE TYPE: Genomic DNA 

40 (vi) ORIGINAL SOURCE: 

(A) ORGANISM: Escherichia coli 

(C) ACCESSION NUMBER: AF190692 



45 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 1930 



ATGAGTATTC AACATTTTCG TGTCGCCCTT ATTCCCTTTT TTGCGGCATT 50 

TTGCCTTCCT GTTTTTGCTC ACCCAGAAAC GCTGGTGAAA GTAAAAGATG 100 

CTGAAGATCA GTTGGGTGCA CGAGTGGGTT ACATCGAACT GGATCTCAAC 150 

AGCGGTAAGA TCCTTGAGAG TTTTCGCCCC GAAGAACGTT TTCCAATGAT 200 

50 GAGCACTTTT AAAGTTCTGC TATGTGGCGC GGTATTATCC CGTGTTGACG 250 

CCGGGCAAGA GCAACTCGGT CGCCGCATAC ACTATTCTCA GAATGACTTG 300 

GTTGAGTACT CACCAGTCAC AGAAAAGCAT CTTACGGATG GCATGACAGT 350 

AAGAGAATTA TGCAGTGCTG CCATAACCAT GAGTGATAAC ACTGCTGCCA 400 
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ACTTACTTCT GACAACGATC GGAGGACCGA 

CACAACATGG GGGATCATGT AACTCGCCTT 

GAATGAAGCC ATACCAAACG ACGAGCGTGA 

TGGCAACAAC GTTGCGCAAA CTATTAACTG 

5 TCCCGGCAAC AATTAATAGA CTGGATGGAG 

ACTTCTGCGC TCGGCCCTTC CGGCTGGCTG 

GAGCCGGTGA GCGTGGGTCT GGCGGTATCA 

GGTAAGCCCT CCCGTATCGT AGTTATCTAC 

TATGGATGAA CGAAATAGAC AGATCGCTGA 
10 AGCATTGGTA A 
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AGGAGCTAAC CGCTTTTTTG 450 

GATCGTTGGG AACCGGAGCT 500 

CACCACGATG CCTGCAGCAA 550 

GCGAACTACT TACTCTAGCT 600 

GCGGATAAAG TTGCAGGACC 650 

GTTTATTGCT GATAAATCTG 700 

TTGCAGCACT GGGGCCAGAT 7 50 

ACGACGGGGA GTCAGGCAAC 800 

GATAGGTGCC TCACTGATTA 850 

861 



2) INFORMATION FOR SEQ ID NO: 1931 



15 



(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 861 bases 

(B) TYPE: Nucleic acid 

(C) STRANDEDNESS: Double 
20 (D) TOPOLOGY: Linear 

(ii) MOLECULE TYPE: Genomic DNA 

(vi) ORIGINAL SOURCE: 
25 (A) ORGANISM: Escherichia coli 

(B) STRAIN: BM2728 

(C) ACCESSION NUMBER: AF104442 



30 



50 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 1931 



ATGAGTATTC AACATTTCCG TGTCGCCCTT ATTCCCTTTT TTGCGGCATT 50 

TTGCCTTCCT GTTTTTGCTC ACCCAGAAAC GCTGGTGAAA GTAAAAGATG 100 

CTGAAGATCA GTTGGGTGCA CGAGTGGGTT ACATCGAACT GGATCTCAAC 150 

AGCGGTAAGA TCCTTGAGAG TTTTCGCCCC GAAGAACGTT TTCCAATGAT 200 

35 GAGCACTTTT AAAGTTCTGC TATGTGGCGC GGTATTATCC CGTGTTGACG 250 

CCGGGCAAGA GCAACTCGGT CGCCGCATAC ACTATTCTCA GAATGACTTG 300 

GTTGAGTACT CACCAGTCAC AG7\AAAGCAT CTTACGGATG GCATGACAGT 350 

AAGAGAATTA TGCAGTGCTG CCATAACCAT GAGTGATAAC ACTGCGGCCA 400 

ACTTACTTCT GACAACGATC GGAGGACCGA AGGAGCTAAC CGCTTTTTTG 4 50 

40 CACAACATGG GGGATCATGT AACTCGCCTT GATCGTTGGG AACCGGAGCT 500 

GAATGAAGCC ATACCAAACG ACGAGCGTGA CACCACGATG CCTGCAGCAA 550 

TGGCAACAAC GTTGCGCAAA CTATTAACTG GCGAACTACT TACTCTAGCT 600 

TCCCGGCTy^C AATTAATAGA CTGGATGGAG GCGGATAAAG TTGCAGGACC 650 

ACTTCTGCGC TCGGCCCTTC CGGCTGGCTG GTTTATTGCT GATAAATCTG 7 00 

45 GAGCCGGTGA GCGTGGGTCT CTCGGTATCA TTGCAGCACT GGGGCCAGAT 750 

GGTAAGCCCT CCCGTATCGT AGTTATCTAC ACGACGGGGA GTCAGGCAAC 800 

TATGGATGAA CGAAATAGAC AGATCGCTGA GATAGGTGCC TCACTGATTA 850 

AGCATTGGTA A 8 61 



2) INFORMATION FOR SEQ ID NO: 1932 
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(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 8 61 bases 

(B) TYPE: Nucleic acid 

(C) STRANDEDNESS : Double 
5 (D) TOPOLOGY: Linear 

(ii) MOLECULE TYPE: Genomic DNA 

<vi) ORIGINAL SOURCE: 
10 (A) ORGANISM: Escherichia coli 

(C) ACCESSION NUMBER: AF190695 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 1932 

15 ATGAGTATTC AACATTTTCG TGTCGCCCTT ATTCCCTTTT TTGCGGCATT 50 

TTGCCTTCCT GTTTTTGCTC ACCCAGAAAC GCTGGTGAAA GTAAAAGATG 100 

CTGAAGATCA GTTGGGTGCA CGAGTGGGTT ACATCGAACT GGATCTCAAC 150 

AGCGGTAAGA TCCTTGAGAG TTTTCGCCCC GAAGAACGTT TTCCAATGCT 200 

GAGCACTTTT AAAGTTCTGC TATGTGGTGC GGTATTATCC CGTGTTGACG 250 

20 CCGGGCAAGA GCAACTCGGT CGCCGCATAC ACTATTCTCA GAATGACTTG 300 

GTTGAGTACT CACCAGTCAC AGAAAAGCAT CTTACGGATG GCATGACAGT 350 

AAGAGAATTA TGCAGTGCTG CCATAACCAT GAGTGATAAC ACTGCTGCCA 400 

ACTTACTTCT GACAACGATC GGAGGACCGA AGGAGCTAAC CGCTTTTTTG 450 

CACAACATGG GGGATCATGT AACTCGCCTT GATCGTTGGG AACCGGAGCT 500 

25 GAATGAAGCC ATACCAAACG ACGAGCGTGA CACCACGATG CCTGCAGCAA 550 

TGGCAACAAC GTTGCGCAAA CTATTAACTG GCGAACTACT TACTCTAGCT 600 

TCCCGGCAAC AATTAATAGA CTGGATGGAG GCGGATAAAG TTGCAGGACC 650 

ACTTCTGCGC TCGGCCCTTC CGGCTGGCTG GTTTATTGCT GATAAATCTG 700 

GAGCCGGTGA GCGTGGGTCT AGCGGTATCA TTGCAGCACT GGGGCCAGAT 750 

30 GGTAAGCCCT CCCGTATCGT AGTTATCTAC ACGACGGGGA GTCAGGCAAC 800 

TATGGATGAA CGAAATAGAC AGATCGCTGA GATAGGTGCC TCACTGATTA 850 

AGCATTGGTA A 861 



35 



45 



2) INFORMATION FOR SEQ ID NO: 1933 



(i) SEQUENCE CHARACTERISTICS: 
(A) LENGTH: 861 bases 
40 (B) TYPE: Nucleic acid 

(C) STRANDEDNESS: Double 

(D) TOPOLOGY: Linear 



(ii) MOLECULE TYPE: Genomic DNA 

(vi) ORIGINAL SOURCE: 

(A) ORGANISM: Escherichia coli 
(C) ACCESSION NUMBER: AF190693 

50 (xi) SEQUENCE DESCRIPTION: SEQ ID NO: 1933 

ATGAGTATTC AACATTTCCG TGTCGCCCTT ATTCCCTTTT TTGCGGCATT 50 
TTGCCTTCCT GTTTTTGCTC ACCCAGAAAC GCTGGTGAAA GTAAAAGATG 100 
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CTGAAGATCA GTTGGGTGCA CGAGTGGGTT 
AGCGGTAAGA TCCTTGAGAG TTTTCGCCCC 
GAGCACTTTT AAAGTTCTGC TATGTGGTGC 
CCGGGCAAGA GCAACTCGGT CGCCGCATAC 
5 GTTGAGTACT CACCAGTCAC AGAAAAGCAT 
;\AGAGAATTA TGCAGTGCTG CCATAACCAT 
ACTTACTTCT GACAACGATC GGAGGACCGA 
CACAACATGG GGGATCATGT AACCCGCCTT 
GAATGAAGCC ATACCAAACG ACGAGCGTGA 

10 TGGCAACAAC GTTGCGCAAA CTATTAACTG 
TCCCGGCAAC AATTAATAGA CTGGATGGAG 
ACTTCTGCGC TCGGCCCTTC CGGCTGGCTG 
GAGCCGGTGA GCGTGGATCT CGCGGTATCA 
GGTAAGCCCT CCCGTATCGT AGTTATCTAC 

15 TATGGATGAA CGAGATAGAC AGATCGCTGA 
AGCATTGGTA A 
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ACATCGAGCT GGATCTCAAC 150 

GAAGAACGTT TTCCAATGGT 200 

GGTATTATCC CGTGTTGACG 250 

ACTATTCTCA GAATGACTTG 300 

CTTACGGATG GCATGACAGT 350 

GAGTGATAAC ACTGCGGCCA 4 00 

AGGAGCTAAC CGCTTTTTTG 450 

GATCGTCGGG AACCGGAGCT 500 

CACCACGATG CCTGCAGCAA 550 

GCGAACTACT TACTCTAGCT 600 

GCGGATAAAG TTGCAGGACC 650 

GTTTATTGCT GATAAATCTG 700 

TTGCAGCACT GGGGCCAGAT 750 

ACGACGGGGA GTCAGGCAAC 800 

GATAGGTGCC TCACTGATTA 850 

861 



20 2) INFORMATION FOR SEQ ID NO: 1934 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 25 bases 

(B) TYPE: Nucleic acid 
25 (C) STRANDEDNESS: Single 

(D) TOPOLOGY: Linear 

(ii) MOLECULE TYPE: DNA 
30 (xi) SEQUENCE DESCRIPTION: SEQ ID NO: 1934 

GAACGCCAGC GCGAAATTCA AAAAG 25 



35 



45 



50 



2) INFORMATION FOR SEQ ID NO: 1935 



(i) SEQUENCE CHARACTERISTICS: 
(A) LENGTH: 21 bases 
40 (B) TYPE: Nucleic acid 

(C) STRANDEDNESS: Single 

(D) TOPOLOGY: Linear 



(ii) MOLECULE TYPE: DNA 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 1935 
AGCTCGGCAT ACTTCGACAG G 21 

2) INFORMATION FOR SEQ ID NO: 1936 
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10 



30 



40 



45 



(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 15 bases 

(B) TYPE: Nucleic acid 

(C) STRANDEDNESS: Single 

(D) TOPOLOGY-: Linear 

(ii) MOLECULE TYPE: DNA 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 1936 
TACCACCCGC ACGGC 15 



15 2) INFORMATION FOR SEQ ID NO: 1937 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 17 bases 

(B) TYPE: Nucleic acid 
20 (C) STRANDEDNESS: Single 

(D) TOPOLOGY: Linear 

(ii) MOLECULE TYPE: DNA 
25 (xi) SEQUENCE DESCRIPTION: SEQ ID NO: 1937 

CGGAGTCGCC GTCGATG 17 



2) INFORMATION FOR SEQ ID NO: 1938 



(i) SEQUENCE CHARACTERISTICS: 
(A) LENGTH: 41 bases 
35 (B) TYPE: Nucleic acid 

(C) STRANDEDNESS: Single 

(D) TOPOLOGY: Linear 



(ii) MOLECULE TYPE: DNA 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 1938 
CCGCGCACCA TTGCTTCGTA CACTGAGGAG TCTCCGCGCG G 41 

2) INFORMATION FOR SEQ ID NO: 1939 



(i) SEQUENCE CHARACTERISTICS: 
50 (A) LENGTH: 46 bases 

(B) TYPE: Nucleic acid 

(C) STRANDEDNESS: Single 

(D) TOPOLOGY: Linear 
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(ii) MOLECULE TYPE: DNA 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 1939 
CGACCCGGAT GGTAGTATCG ATAATGATCC GCCAGCGGCC GGGTCG 



2) INFORMATION FOR SEQ ID NO: 1940 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 19 bases 

(B) TYPE: Nucleic acid 

(C) STRANDEDNESS: Single 

(D) TOPOLOGY: Linear 

(ii) MOLECULE TYPE: DNA 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 1940 
GTATCGTTGG TGACGTAAT 



2) INFORMATION FOR SEQ ID NO: 1941 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 18 bases 

(B) TYPE: Nucleic acid 

(C) STRANDEDNESS: Single 

(D) TOPOLOGY: Linear 

(ii) MOLECULE TYPE: DNA 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 1941 
GCAATGGTCC GTTTAAGT 



2) INFORMATION FOR SEQ ID NO: 1942 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 27 bases 

(B) TYPE: Nucleic acid 

(C) STRANDEDNESS: Single 

(D) TOPOLOGY: Linear 

(ii) MOLECULE TYPE: DNA 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 1942 
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2) INFORMATION FOR SEQ ID NO: 1943 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 16 bases 

(B) TYPE: Nucleic acid 

(C) STRANDEDNESS : Single 

(D) TOPOLOGY: Linear 

(ii) MOLECULE TYPE: DNA 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 1943 
GCTGGTGGAC GGCCAG 



2) INFORMATION FOR SEQ ID NO: 1944 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 19 bases 

(B) TYPE: Nucleic acid 

( C ) STRANDEDNESS : Single 

(D) TOPOLOGY: Linear 

(ii) MOLECULE TYPE: DNA 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 1944 
TTTCGCCGCC ATGCGTTAC 



2) INFORMATION FOR SEQ ID NO: 1945 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 17 bases 

(B) TYPE: Nucleic acid 

(C) STRANDEDNESS: Single 

(D) TOPOLOGY: Linear 

(ii) MOLECULE TYPE: DNA 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 1945 
CGGCGACTAC GCGGTAT 



2) INFORMATION FOR SEQ ID NO: 1946 
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(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 17 bases 

(B) TYPE: Nucleic acid 

(C) STRANDEDNESS : Single 

(D) TOPOLOGY: Linear 

(ii) MOLECULE TYPE: DNA 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 1946 
CGGCGACTTC GCGGTAT 



2) INFORMATION FOR SEQ ID NO: 1947 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 19 bases 

(B) TYPE: Nucleic acid 

(C) STRANDEDNESS: Single 

(D) TOPOLOGY: Linear 

(ii) MOLECULE TYPE: DNA 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 1947 
CGGTATACGG CACCATCGT 



2) INFORMATION FOR SEQ ID NO: 1948 

(i) SEQUENCE CHAilACTERISTICS: 

(A) LENGTH: 19 bases 

(B) TYPE: Nucleic acid 

(C) STRANDEDNESS: Single 

(D) TOPOLOGY: Linear 

(ii) MOLECULE TYPE: DNA 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 1948 
GCGGTATACA ACACCATCG 



2) INFORMATION FOR SEQ ID NO: 1949 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 19 bases 

(B) TYPE: Nucleic acid 

(C) STRANDEDNESS: Single 
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(D) TOPOLOGY: Linear 

(ii) MOLECULE TYPE: DNA 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 

CGGTATACGC CACCATCGT 



2) INFORMATION FOR SEQ ID NO: 1950 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 15 bases 

(B) TYPE: Nucleic acid 

(C) STRANDEDNESS: Single 

(D) TOPOLOGY: Linear 

(ii) MOLECULE TYPE: DNA 
(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 1950 
GGCGACATCG CCTGC 



2) INFORMATION FOR SEQ ID NO: 1951 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 17 bases 

(B) TYPE: Nucleic acid 

(C) STRANDEDNESS: Single 

(D) TOPOLOGY: Linear 

(ii) MOLECULE TYPE: DNA 
(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 1951 
GGCGACAGAG CCTGCTA 



2) INFORMATION FOR SEQ ID NO: 1952 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 19 bases 

(B) TYPE: Nucleic acid 

(C) STRANDEDNESS: Single 

(D) TOPOLOGY: Linear 

(ii) MOLECULE TYPE: DNA 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 1952 

1015 
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CCTGCTATGG AGCGATGGT 19 



2) INFORMATION FOR SEQ ID NO: 1953 

(i) SEQUENCE CHARACTERISTICS: 
(A) LENGTH: 21 bases 
LO (B) TYPE: Nucleic acid 

(C) STRANDEDNESS : Single 

(D) TOPOLOGY: Linear 



L5 



>0 



(ii) MOLECULE TYPE: DNA 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 1953 
CGCCTGCTAT AAAGCGATGG T 21 

2) INFORMATION FOR SEQ ID NO: 1954 



(i) SEQUENCE CHARACTERISTICS: 
>5 (A) LENGTH: 589 bases 

(B) TYPE: Nucleic acid 

(C) STRANDEDNESS: Double 

(D) TOPOLOGY: Linear 

50 (ii) MOLECULE TYPE: Genomic DNA 

(vi) ORIGINAL SOURCE: 

(A) ORGANISM: Klebsiella pneumoniae subsp. pneumoniae 

(B) STRAIN: ATCC 13883 

55 (C) ACCESSION NUMBER: AF052258 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 1954 

ACACCGGTCA ACATTGAGGA AGAGCTTAAG AACTCTTATC TGGATTATGC 50 

10 GATGTCGGTC ATTGTTGGCC GTGCGCTGCC GGATGTCCGA GATGGCCTGA 100 

AGCCGGTACA CCGTCGCGTA CTTTACGCCA TGAACGTATT GGGCAATGAC 150 

TGGAACAAAG CCTATAAAAA ATCAGCCCGT GTCGTTGGTG ACGTAATCGG 200 

TAAATACCAC CCGCACGGCG ACTCCGCGGT ATACGACACC ATCGTGCGTA 250 

TGGCGCAGCC GTTCTCGCTG CGTTACATGC TGGTGGACGG CCAGGGTAAC 300 

15 TTTGGTTCCA TCGACGGCGA CTCCGCCGCG GCGATGCGTT ATACCGAAAT 350 

TCGTCTGGCG AAAATCGCTC ATGAGCTGAT GGCCGATCTT GAAAAAGAGA 400 

CGGTCGATTT CGTCGACAAC TATGACGGTA CGGAGCGTAT TCCGGACGTC 450 

ATGCCGACCA AAATTCCTAA CCTGCTGGTG AACGGCGCCT CCGGGATCGC 500 

CGTAGGGATG GCCACCAACA TACCGCCACA TAACCTGACG GAAGTGATTA 550 

50 ACGGCTGTCT GGCGTATGTT GACGATGAAG ACATCAGCA 589 
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2) INFORMATION FOR SEQ ID NO: 1955 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 38 bases 

(B) TYPE: Nucleic acid 

(C) STRANDEDNESS: Single 

(D) TOPOLOGY: Linear 

(ii) MOLECULE TYPE: DNA 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 1955 
CGCGCAACCA TTGCTTCGTA CACTGAGGAG TCTGCGCG 



2) INFORMATION FOR SEQ ID NO: 1956 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 989 bases 

(B) TYPE: Nucleic acid 

(C) STRANDEDNESS: Double 

(D) TOPOLOGY: Linear 

(ii) MOLECULE TYPE: Genomic DNA 

(vi) ORIGINAL SOURCE: 

(A) ORGANISM; Candida inconspicua 

(B) STRAIN: ATCC 16783 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 1956 

CGGTAAAACC ACTTTT^CTG CTGCAATCAC CAAAGTTTTA TCTGAAGAAG 50 

GTGGTGCAGA TTTCTTAGAT TATTCATCTA TTGATAAAGC ACCAGAAGAG 100 

AGAGCTAGAG GTATTACCAT TTCTACTGCT CATGTTGAAT ATGAAACTCC 150 

AAACAGACAT TATTCACATG TTGATTGTCC AGGTCATCAA GATTATATTA 200 

AGAACATGAT TACTGGTGCA GCTCAAATGG ATGGTGCTAT TATTGTTGTT 250 

GCAGCTACTG ATGGTCAAAT GCCACAAACT AAGGAACATT TATTATTAGC 3 00 

TAGACAAGTT GGTGTTCAAC ATTTAGTTGT TTTTGTTAAC AAGTGTGATA 3 50 

CCGTTGATGA TCCAGAAATG TTRGAATTAG TTGAAATGGA AATGAGAGAA 4 00 

TTATTGACCG 7VATATGGATT YGATGGTGAT AACACACCAG TTATTATGGG 450 

TTCTGCATTA ATGGCATTAG AAGGTAAGAG ACCAGAAGTT GGTAAAGAAT 500 

CTATTGTTAA GTTAATGGAA GCTGTTGATA CTTGGATTCC AACTCCACAA 550 

AGAGACTTAG AAAAACCATT CTTATTACCA ATTGATGAAG TTTTCTCTAT 600 

TTCTGGTAGA GGTACTGTTG TTTCTGGTAC TGTTGATAGA GGTACATTAA 650 

AGAAGGGTGA AGAAGTTGAA ATTGTTGGTG GTAAAGAAGG TGTTATTAAG 700 

ACTACTGTTA CCGGTATTGA AATGTATCAC AAGGAATTAG ATCAAGCACA 750 

AGCTGGTGAT ACTCCAGGTA TTTTGTTAAG AGGTGTTAAG AGAGATCAAA 800 

TTGCAAGAGG TCAAATTCTT GCAAAGCCAG GWTCTGTTAA GGCATACAAG 850 

AA6TTCTTAT CATCATTATA CATTTTAACA AAGGAAGAAG GTGGTAGACA 900 

TACTCCATTT TCTGAAAATT ACAGACCTCA AATGTACATT AGAACTTCCA 950 

ATGTTAATGT TACTTTGAAG TTCCCAGAAA CTGAAGAAG 989 
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2) INFORMATION FOR SEQ ID NO: 1957 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 991 bases 

(B) TYPE: Nucleic acid 

(C) STRANDEDNESS: Double 

(D) TOPOLOGY: Linear 

{ii) MOLECULE TYPE: Genomic DNA 

(vi) ORIGINAL SOURCE: 

(A) ORGANISM: Candida utilis 

(B) STRAIN: ATCC 22023 

15 (xi) SEQUENCE DESCRIPTION: SEQ ID NO: 1957 

GGTAAGACCA CCCTTACTGC CGCCATCACC AAGTGCCTTG CTGAGAAGGG 50 

AGGTGCCTCG TTCTTGGACT ACAGTGCCAT CGACAAGGCG CCAGAGGAGA 100 

GAGCAAGAGG TATCACCATC TCCACTGCGC ACGTTGAGTA TGAAACTGCC 150 

2 0 AACAGACACT ACTCGCACGT TGACTGTCCA GGTCACGCTG ATTACATCAA 200 

GAACATGATT ACCGGTGCTG CGCAGATGGA CGGTGCTATC ATTGTCGTTG 250 

CAGCCACTGA CGGTCAGATG CCACAGACCA GAGAACACTT GTTGCTTGCC 300 

AGACAAGTTG GTGTCCAGCA CATTGTTGTC TTCGTCAACA AGGTTGACAC 350 

CATCGACGAC CCTGAGATGC TTGAGCTTGT TGAAATGGAG ATGAGAGAGT 400 

25 TGCTTACTTC GTATGGATTT GACGGTGATA ACACCCCAGT TATCATGGGT 450 

TCTGCTTTGT GTGCTTTGGA AGGCCGTGAG CCAGAGATTG GTGCTAAGGC 500 

CATTGACAAG TTGATGGAGG CCATTGATGA GTACATCCCA ACTCCTCAGA 550 

GAGACCTGGA AAAGCCATTC YTGATGGGTG TTGAAGACGT GTTCTCGATC 600 

TCTGGTAGAG GTACCGTTGT CACRGGCCGT GTTGAGCGTG GTAACTTGAA 650 

30 GAAAGGTGAT GAAATTGAAC TTGTTGGCTA CAACAAGAAC CCAATCAAGA 700 

CCACCGTCAC CGGTATCGAA ATGTTCAAGA AGGAGTTGGA ATCTGCCATG 750 

GCTGGTGACA ACTGTGGTAT CTTGTTGCGT GGTATCAAGA GAGATGACGT 800 

CAAGAGAGGT ATGGTTGCTG CTAAGCCAGG CTCCGTCTCT GCACACACCA 850 

AGTTCCTCGC TTCCTTGTAC ATCCTGACRA AGGAGGAAGG TGGTCGTCAC 900 

35 AGTGCCTTTG CTGAGAACTA CAGACCACAG ATGTTCATCA GAACCGGAGA 950 

TGTCACCACC ATCTTGACAT GGCCAGAGGA GCACGCTGAC C 991 



40 2) INFORMATION FOR SEQ ID NO: 1958 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 985 bases 

(B) TYPE: Nucleic acid 
45 (C) STRANDEDNESS: Double 

(D) TOPOLOGY: Linear 

(ii) MOLECULE TYPE: Genomic DNA 

50 (vi) ORIGINAL SOURCE: 

(A) ORGANISM: Candida zeylanoides 

(B) STRAIN: ATCC 7351 
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(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 1958 

CGGTAAGACC ACTTTGACCG CCGCCATCAC CAAGGTGTTG AGCGCCAAAG 50 

GTGGTGCTTC CTTCTTGGAC TACGGGTCCA TCGACAGAGC CCCTGAGGAG 100 

AGAGCCAGAG GTATTACTAT CTCGACTGCC CACGTTGAGT ACGAGACCGA 150 

TAAGAGACAC TACGCCCACG TTGATTGCCC TGGTCACGCT GATTACATCA 2 00 

AGAACATGAT CACTGGTGCC GCCCAAATGG ACGGTGCCAT TATTGTCGTT 250 

GCTGCTTCTG ATGGCCAAAT GCCGCAGACC AGAGAGCACT TGTTGCTTGC 300 

CAGACAGGTT GGTGTGCAGA ACTTGGTTGT GTTTGTTAAC AAGGTGGACA 3 50 

CCATCGACGA CCCCGAAATG TTGGAGTTGG TGGAGATGGA AATGAGAGAA 4 00 

TTGTTGACCC ACTACGGCTT TGACGGTGAC AACACCCCTG TCATCATGGG 450 

TTCGGCGTTG TGTGCCTTGG AAGACAGGCA GCCTGAGATT GGCGAGCAAG 500 

CCATCATGAA GTTGTTGGAC GCTGTCGACG AGTACATTCC CACTCCTCAG 550 

AGAGACTTGG AGCAACCATT TTTGATGCCC GTTGAGGATG TTTTCTCCAT 600 

CTCTGGCAGA GGTACTGTTG TCACCGGTCG TGTTGAGAGA GGCTCATTGA 650 

AGAAGGGTGA GGAGATTGAG ATTGTTGGCG ACTTCCCCAA GCCCTTCAAG 700 

ACTACCGTCA CCGGCATTGA GATGTTCAAG AAGGAGTTGG ATGCCGCGAT 750 

GGCGGGCGAC AACGCCGGGA TCTTGTTGAG AGGTGTCAAG AGAGACGAGG 800 

TCTCGAGAGG TATGGTTTTG GCCAAGCCCG GTACTGTCAC TTCGCACACC 850 

AAGGTGTTGG CGTCGCTTTA CATCTTGACC AAAGAGGAAG GTGGCCGCCA 900 

CTCGCCCTTT GGTGAGAACT ACAAGCCACA GTTATTCATC AGAACCTCCG 950 

ATGTCACTGG TACTTTGAGG TTCCCCGCCG GTGAG 985 



2) INFORMATION FOR SEQ ID NO: 1959 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 97 3 bases 

(B) TYPE: Nucleic acid 

(C) STRANDEDNESS: Double 

(D) TOPOLOGY: Linear 

(ii) MOLECULE TYPE: Genomic DNA 

(vi) ORIGINAL SOURCE: 

(A) ORGANISM: Candida catenulata 

(B) STRAIN: ATCC 10565 

(C) ACCESSION NUMBER: 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 1959 

CGGTTVAGACC ACCTTGACTG CCGCCATCAC CAAGGTTCTC TCCGAGAAGG 50 

GTGGTGCCGA CTTCTTGGAC TACGGTGCCA TTGACAGAGC CCCCGAGGAG 100 

CGTGCCCGTG GTATCACCAT CTCCACTGCC CACGTTGAGT ACGAGACTGA 150 

CAACCGTCAC TACGCCCACA TTGACTGTCC CGGTCACGCT GATTACATCA 2 00 

AGAACATGAT TACCGGTGCC GCCCAGATGG ACGGTGCCAT TATTGTCGTT 250 

GCTGCTACTG ACGGTGCCAT GCCCCAGACC CGCGAGCACT TGCTTCTCGC 300 

CCGTCAGGTT GGTATCCAGG AATTGGTTGT GTTTGTGAAC AAGGTTGACA 350 

CCATCGACGA CCCCGAGATG TTGGAGCTCG TTGAGATGGA GATCCGCGAG 400 

TTGTTGTCTG AGTTCGGTTT TGACGGTGAC AACACCCCCG TCATCATGGG 4 50 

TTCCGCTTTG TGCGCTTTGG AGGGCAAGCA GCCCGAGATT GGTGAGCAGG 500 

CTATCACCAA GTTGATGGCC GCCGTTGACG AGCACATCCC CACCCCCCAG 550 
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CGTGACTTGG AGCAGCCTTT CTTGATGCCT 
CTCTGGCCGT GGTACCGTGG TGACTGGTAA 
AGAAGGGTGA GGAGATTGAG ATTGTTGGCA 
GTGACTGTTA CTGGTATTGA GATGTTCAAG 
5 GGCTGGTGAC AACGCCGGTA TCTTGTTGCG 
TGTCTCGTGG TATGGTTTTG GCCAAGCCCG 
AAGGTTTTGG CTTCGCTTTA CATCTTGACC 
GACCGGCTTC GGCTCCAACT ACAAGCCCCA 
ACGTCACTGG TACCCTCACC TTC 
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GTTGAGGGTG TTTTCTCTAT 600 

GGTTGCCCGT GGTGTCCTCA 650 

ACTTTGACAA GCCCTACAAG 70 0 

AAGGAGTTGG ACCAGGCCAT 750 

TGGTGTCAAG CGTGACGAGG 800 

GCACTGTTGT CTCGCACAAG 850 

CAGGAGGAGG GTGGCCGTAA 900 

GTTGTTCTTG CGCACTACCG 950 

973 



2) INFORMATION FOR SEQ ID NO: 1960 

L5 (i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 985 bases 

(B) TYPE: Nucleic acid 

(C) STRANDEDNESS: Double 

(D) TOPOLOGY: Linear 



10 



(ii) MOLECULE TYPE: Genomic DNA 



(vi) ORIGINAL SOURCE: 

(A) ORGANISM: Candida krusei 
IS (B) STRAIN: ATCC 28870 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 1960 

AAGACTACCT TGACTGCTGC AATCACCAAG GTCTTAGCTG ATCAAGGTGG 50 

10 TGCTGATTTC TTAGATTATG CATCTATTGA CAAGGCTCCT GAAGAAAGAG 100 

CAAGAGGTAT TACTATCTCT ACTGCTCACG TTGAGTATGA AACCCCAAAC 150 

AGACATTATT CTCATGTCGA TTGTCCTGGC CATCAAGATT ATATTAAGAA 2 00 

TATGATTACT GGTGCTGCAC AAATGGATGG TGCTATTATT GTTGTTGCTG 250 

CTACTGATGG TCAAATGCCA CT^AACTAAGG AACATTTATT ATTAGCAAGA 300 

IS CAAGTTGGTG TTCAACATTT AGTTGTCTTT GTTAATAAAT GTGACACCAT 350 

TGATGACCCA GAAATGTTGG AATTAGTTGA AATGGAAATG AGAGAACTAT 400 

TGTCTGAATA TGGTTTTGAT GGTGATAACA CTCCAGTTAT TATGGGTTCT 450 

GCATTGATGG CTTTAGAAGA CAAGAGACCT GAAGTTGGTA AGGAATCTAT 500 

TTTAAAGTTA ATGGAAGCYG TTGACACATG GATTCCAACC CCAGAGAGAG 550 

10 ATTTAGAAAA ACCATTTTTG TTACCTATTG ATGAAGTTTT CTCAATCTCT 600 

GGTAGAGGTA CTGTCGTTTC TGGTACTGTC GAAAGAGGTA CTTTGAAGAA 650 

GGGTGAAGAA GTTGAAATTG TTGGTGGTAA GGATGGTTCT ATTAAAACTA 700 

CTGTCACAGG TATTGAAATG TATCACAAGG AATTAGACCA AGCGCAAGCA 750 

GGTGATACTC CAGGTATTTT ATTAAGAGGT GTCAAGAGAG ACCAAATCAA 800 

15 GAGAGGTCAA ATTTTAGCAA AGCCAGATTC CGTTAAGGCA TACAAGAAGT 850 

TCTTGGCTTC CCTTTATATC TTAACCAAGG AAGAAGGTGG TAGACATACA 900 

CCATTCTCTG AAAACTACAG ACCACAAATG TACATCAGAA CTACCAATGT 950 

TAACGTTACT TTGAAGTTCC CAGACACTGA AGAAG 985 



2) INFORMATION FOR SEQ ID NO: 1961 
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10 



30 



i5 



(i) SEQUENCE CHAiy^CTERISTICS : 

(A) LENGTH: 22 bases 

(B) TYPE: Nucleic acid 

(C) STRANDEDNESS : Single 

(D) TOPOLOGY: Linear 

(ii) MOLECULE TYPE: DNA 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 1961 
GCTCAAGGCA GATGGCATTC CC 22 



15 2) INFORMATION FOR SEQ ID NO: 1962 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 22 bases 

(B) TYPE: Nucleic acid 
20 (C) STRANDEDNESS: Single 

(D) TOPOLOGY: Linear 

(ii) MOLECULE TYPE: DNA 
25 (xi) SEQUENCE DESCRIPTION: SEQ ID NO: 1962 

GGACAAGGCG GTTGCGTTTG AT 22 



2 ) INFORMATION FOR SEQ ID NO: 1963 



(i) SEQUENCE CHARACTERISTICS: 
(A) LENGTH: 22 bases 
35 (B) TYPE: Nucleic acid 

(C) STIUVNDEDNESS : Single 

(D) TOPOLOGY: Linear 



(ii) MOLECULE TYPE: DNA 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 1963 
CATTCCCGTC TCGCTCGACA GT 22 

2) INFORMATION FOR SEQ ID NO: 1964 



(i) SEQUENCE CHARACTERISTICS: 
50 (A) LENGTH: 19 bases 

(B) TYPE: Nucleic acid 

(C) STRANDEDNESS: Single 

(D) TOPOLOGY: Linear 
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5 



25 



(ii) MOLECULE TYPE: DNA 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 1964 
ATCTGCCTGC CCGTCTTGC 19 



10 2) INFORMATION FOR SEQ ID NO: 1965 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 816 bases 

(B) TYPE: Nucleic acid 
15 (C) STRANDEDNESS: Double 

(D) TOPOLOGY: Linear 

(ii) MOLECULE TYPE: Genomic DNA 

20 (vi) ORIGINAL SOURCE: 

(A) ORGANISM: Plasmid pGSOS 

(C) ACCESSION NUMBER: M36657 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 1965 



ATGAATAAAT CGCTCATCAT TTTCGGCATC GTCAACATAA CCTCGGACAG 50 

TTTCTCCGAT GGAGGCCGGT ATCTGGCGCC AGACGCAGCC ATTGCGCAGG 100 

CGCGTAAGCT GATGGCCGAG GGGGCAGATG TGATCGACCT CGGTCCGGCA 150 

TCCAGCAATC CCGACGCCGC GCCTGTTTCG TCCGACACAG AAATCGCGCG 200 

30 TATCGCGCCG GTGCTGGACG CGCTCAAGGC AGATGGCATT CCCGTCTCGC 250 

TCGACAGTTA TCAACCCGCG ACGC7\AGCCT ATGCCTTGTC GCGTGGTGTG 300 

GCCTATCTCA ATGATATTCG CGGTTTTCCA GACGCTGCGT TCTATCCGCA 350 

ATTGGCGAAA TCATCTGCCA AACTCGTCGT TATGCATTCG GTGCAAGACG 4 00 

GGCAGGCAGA TCGGCGCGAG GCACCCGCTG GCGACATCAT GGATCACATT 450 

35 GCGGCGTTCT TTGACGCGCG CATCGCGGCG CTGACGGGTG CCGGTATCAA 500 

ACGCAACCGC CTTGTCCTTG ATCCCGGCAT GGGGTTTTTT CTGGGGGCTG 550 

CTCCCGAAAC CTCGCTCTCG GTGCTGGCGC GGTTCGATGA ATTGCGGCTG 600 

CGCTTCGATT TGCCGGTGCT TCTGTCTGTT TCGCGCAAAT CCTTTCTGCG 650 

CGCGCTCACA GGCCGTGGTC CGGGGGATGT CGGGGCCGCG ACACTCGCTG 700 

10 CAGAGCTTGC CGCCGCCGCA GGTGGAGCTG ACTTCATCCG CACACACGAG 750 

CCGCGCCCCT TGCGCGACGG GCTGGCGGTA TTGGCGGCGC TGAAAGAAAC 800 

CGCAAGAATT CGTTAA 816 



i5 

2) INFORMATION FOR SEQ ID NO: 1966 

(i) SEQUENCE CHARACTERISTICS: 
(A) LENGTH: 19 bases 
50 (B) TYPE: Nucleic acid 

(C) STRANDEDNESS: Single 

(D) TOPOLOGY: Linear 
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(ii) MOLECULE TYPE: DNA 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 1966 
5 CATGCCAGTC TTGCCAACG 19 



10 



2) INFOW^TION FOR SEQ ID NO: 1967 



(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 23 bases 

(B) TYPE: Nucleic acid 

(C) STRANDEDNESS : Single 
15 (D) TOPOLOGY: Linear 

(ii) MOLECULE TYPE: DNA 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 1967 

20 

CAGCAATAAG TAATCCAGCG ATG 23 



25 2) INFORMATION FOR SEQ ID NO: 1968 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 20 bases 

(B) TYPE: Nucleic acid 
30 (C) STRANDEDNESS: Single 

(D) TOPOLOGY: Linear 

(ii) MOLECULE TYPE: DNA 
35 (xi) SEQUENCE DESCRIPTION: SEQ ID NO: 1968 

GGAGAGATTT CACCGCATAG 20 



10 



50 



2) INFORMATION FOR SEQ ID NO: 1969 



(i) SEQUENCE CHARACTERISTICS: 
(A) LENGTH: 22 bases 
15 (B) TYPE: Nucleic acid 

(C) STRANDEDNESS: Single 

(D) TOPOLOGY: Linear 



(ii) MOLECULE TYPE: DNA 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 1969 

AGCCAACCAT CATGCTATTC CA 22 
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2) INFORMATION FOR SEQ ID NO: 1970 



(i) 



SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 1206 bases 

(B) TYPE: Nucleic acid 

(C) STRANDEDNESS: Double 

(D) TOPOLOGY: Linear 



5 



(ii) 



MOLECULE TYPE: Genomic DNA 



LO 



(vi) 



ORIGINAL SOURCE: 
(A) ORGANISM: Transposon TnlO 
(C) ACCESSION NUMBER: J01830 



L5 



(xi) 



SEQUENCE DESCRIPTION: SEQ 



ID NO: 1970 



ATGAATAGTT CGACAAAGAT CGCATTGGTA ATTACGTTAC TCGATGCCAT 
GGGGATTGGC CTTATCATGC CAGTCTTGCC AACGTTATTA CGTGAATTTA 
TTGCTTCGGA AGATATCGCT AACCACTTTG GCGTATTGCT TGCACTTTAT 

2 0 GCGTTAATGC AGGTTATCTT TGCTCCTTGG CTTGGAAAAA TGTCTGACCG 
ATTTGGTCGG CGCCCAGTGC TGTTGTTGTC ATTAATAGGC GCATCGCTGG 
ATTACTTATT GCTGGCTTTT TCAAGTGCGC TTTGGATGCT GTATTTAGGC 
CGTTTGCTTT CAGGGATCAC AGGAGCTACT GGGGCTGTCG CGGCATCGGT 
CATTGCCGAT ACCACCTCAG CTTCTCAACG CGTGAAGTGG TTCGGTTGGT 

25 TAGGGGCAAG TTTTGGGCTT GGTTTAATAG CGGGGCCTAT TATTGGTGGT 
TTTGCAGGAG AGATTTCACC GCATAGTCCC TTTTTTATCG CTGCGTTGCT 
AAATATTGTC ACTTTCCTTG TGGTTATGTT TTGGTTCCGT GAAACCAAAA 
ATACACGTGA TAATACAGAT ACCGAAGTAG GGGTTGAGAC GCAATCGAAT 
TCGGTATACA TCACTTTATT TAAAACGATG CCCATTTTGT TGATTATTTA 

JO TTTTTCAGCG CAATTGATAG GCCAAATTCC CGCAACGGTG TGGGTGCTAT 
TTACCGAATIA TCGTTTTGGA TGGAATAGCA TGATGGTTGG CTTTTCATTA 
GCGGGTCTTG GTCTTTTACA CTCAGTATTC CAAGCCTTTG TGGCAGGAAG 
AATAGCCACT AAATGGGGCG AAAAAACGGC AGTACTGCTC GAATTTATTG 
CAGATAGTAG TGCATTTGCC TTTTTAGCGT TTATATCTGA AGGTTGGTTA 

J 5 GATTTCCCTG TTTTAATTTT ATTGGCTGGT GGTGGGATCG CTTTACCTGC 
ATTACAGGGA GTGATGTCTA TCCAAACAAA GAGTCATGAG CAAGGTGCTT 
TACAGGGATT ATTGGTGAGC CTTACCAATG CAACCGGTGT TATTGGCCCA 
TTACTGTTTA CTGTTATTTA TAATCATTCA CTACCAATTT GGGATGGCTG 
GATTTGGATT ATTGGTTTAG CGTTTTACTG TATTATTATC CTGCTATCGA 

10 TGACCTTCAT GTTAACCCCT CAAGCTCAGG GGAGTAAACA GGAGACAAGT 
GCTTAG 



15 2) INFORMATION FOR SEQ ID NO: 1971 



50 
100 
150 
200 
250 
300 
350 
400 
450 
. 500 
550 
600 
650 
700 
750 
800 
850 
900 
950 
1000 
1050 
1100 
1150 
1200 
1206 



(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 21 bases 

(B) TYPE: Nucleic acid 

(C) STRANDEDNESS: Single 

(D) TOPOLOGY: Linear 



(ii) MOLECULE TYPE: DNA 
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(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 1971 
CYGACTGYGC CATCCTYATC A 21 

5 

2) INFORMATION FOR SEQ ID NO: 1972 

(i) SEQUENCE CHARACTERISTICS: 
10 (A) LENGTH: 21 bases 

(B) TYPE: Nucleic acid 

(C) STRANDEDNESS: Single 

(D) TOPOLOGY: Linear 

15 (ii) MOLECULE TYPE: DNA 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 1972 
MGICAGCTCA TYITTGCWKS C 21 

20 

2) INFORMATION FOR SEQ ID NO: 1973 

25 (i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 21 bases 

(B) TYPE: Nucleic acid 

(C) STRANDEDNESS: Single 

(D) TOPOLOGY: Linear 

30 

(ii) MOLECULE TYPE: DNA 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 1973 
35 RACACCRGIY TTGGWITCCT T 21 



iO 



2) INFORMATION FOR SEQ ID NO: 1974 



(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 22 bases 

(B) TYPE: Nucleic acid 

(C) STRANDEDNESS: Single 
15 (D) TOPOLOGY: Linear 

(ii) MOLECULE TYPE: DNA 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 1974 

SO 

ACAAGGGITG GRMSAAGGAG AC 22 
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2) INFORMATION FOR SEQ ID NO: 1975 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 21 bases 

(B) TYPE: Nucleic acid 

(C) STRANDEDNESS: Single 

(D) TOPOLOGY: Linear 

(ii) MOLECULE TYPE: DNA 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 1975 
TGRCCRGGGT GGTTRAGGAC G 



2) INFORMATION FOR SEQ ID NO: 1976 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 21 bases 

(B) TYPE: Nucleic acid 

(C) STRANDEDNESS: Single 

(D) TOPOLOGY: Linear 

(ii) MOLECULE TYPE: DNA 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 1976 
GATGGAYTCY GTYAAITGGG A 



2) INFORMATION FOR SEQ ID NO: 1977 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 21 bases 

(B) TYPE: Nucleic acid 

(C) STRANDEDNESS: Single 

(D) TOPOLOGY: Linear 

(ii) MOLECULE TYPE: DNA 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 1977 
GATGGAYTCY GTYT^TGGG A 



2) INFORMATION FOR SEQ ID NO: 1978 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 23 bases 

(B) TYPE: Nucleic acid 
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(C) STRANDEDNESS: Single 

(D) TOPOLOGY: Linear 

(ii) MOLECULE TYPE: DNA 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 1978 
CATCITGYAA TGGYAATCTY AAT 



2) INFORMATION FOR SEQ ID NO: 1979 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 23 bases 

(B) TYPE: Nucleic acid 

(C) STRANDEDNESS: Single 

(D) TOPOLOGY: Linear 

(ii) MOLECULE TYPE: DNA 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 1979 
CATCYTGYAA TGGYAASCTY AAT 



2) INFORMATION FOR SEQ ID NO: 1980 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 22 bases 

(B) TYPE: Nucleic acid 

(C) STRANDEDNESS: Single 

(D) TOPOLOGY: Linear 

(ii) MOLECULE TYPE: DNA 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 1980 
TCRATGGCIT CIAIE^GRGT YT 



2) INFORMATION FOR SEQ ID NO: 1981 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 21 bases 

(B) TYPE: Nucleic acid 

(C) STRANDEDNESS: Single 

(D) TOPOLOGY: Linear 

(ii) MOLECULE TYPE: DNA 
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(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 1981 

TGGACACCIS CAAGIGGKCY G 21 



2) INFORMATION FOR SEQ ID NO: 1982 

(i) SEQUENCE CHARACTERISTICS: 
10 (A) LENGTH: 21 bases 

(B) TYPE: Nucleic acid 

(C) STRANDEDNESS: Single 

(D) TOPOLOGY: Linear 

15 (ii) MOLECULE TYPE: DNA 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 1982 
TGGACACYIS CAAGIGGKCY G 21 

20 

2) INFORMATION FOR SEQ ID NO: 1983 

25 (i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 21 bases 

(B) TYPE: Nucleic acid 

(C) STRANDEDNESS: Single 

(D) TOPOLOGY: Linear 

30 

(ii) MOLECULE TYPE: DNA 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 1983 
35 CYGAYTGCGC YATICTCATC A 21 



40 



2) INFORMATION FOR SEQ ID NO: 1984 



(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 21 bases 

(B) TYPE: Nucleic acid 

(C) STRANDEDNESS: Single 
45 (D) TOPOLOGY: Linear 

(ii) MOLECULE TYPE: DNA 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 1984 

50 

CYGAYTGYGC YATYCTSATC A 21 
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2) INFORMATION FOR SEQ ID NO: 1985 



(i) 



SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 1383 bases 

(B) TYPE: Nucleic acid 

(C) STRANDEDNESS: Double 

(D) TOPOLOGY: Linear 



5 



(ii) 



MOLECULE TYPE: Genomic DNA 



LO 



(vi) 



ORIGINAL SOURCE: 

(A) ORGANISM: Cryptococcus neoformans 

(B) STRAIN: Ml-106 

(C) ACCESSION NUMBER: U81804 



L5 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 1985 

ATGGGTAAGG ACAAGCTGCA CGTCAACGTC GTTGTTATCG GTCACGTCGA 
CTCCGGTAAG TCGACCACCA CCGGTCACTT GATCTACAAG TGCGGTGGTA 

20 TCGACAAGCG AACCATTGAG AAGTTCGAGA AGGAGGCTCA AGAGCTCGGA 
AAGTCTTCTT TCAAGTACGC TTGGGTTCTT GACAAGCTTA AGGCCGAGCG 
AGAGCGAGGT ATCACCATCG ACATTGCTCT TTGGAAGTTC GAGACCCCTA 
AGTACCAGGT TACCGTCATT GACGCCCCCG GTCACCGAGA CTTCATCAAG 
AACATGATCA CCGGTACCTC CCAGGCTGAC TGTGCCATCC TCATCATTGC 

25 CACCGGTATC GGTGAGTTCG AGGCTGGTAT CTCCAAGGAC GGTCAGACCC 
GAGAGCACGC CCTCCTCGCC TTCACCCTCG GTGTCAGGCA GCTCATTGTT 
GCTTGCAACA AGATGGACAC CTGCAAGTGG TCTGAGGACC GATTCAACGA 
AATCGTCAAG GAGACCAACG GTTTCATCAA GAAGGTTGGT TACAACCCCA 
AGGCTGTCCC CTTCGTCCCC ATCTCTGGTT GGCACGGTGA CAACATGTTG 

30 GAGGAGACCA CCAACATGCC CTGGTACAAG GGATGGACCA AGGAGACCAA 
GTCCGGTGTT TCCAAGGGTA AGACCCTTCT CGAGGCCATC GACGCCAGTA 
GGCCCCCTAC CCGACCCACC GACAAGCCCC TCCGTCTCCC TCTCCAGGAC 
GTCTACAAGA TCGGTGGTAT CGGCACAGTC CCTGTCGGCC GAGTCGAGAC 
CGGTGTCATC AAGGCCGGTA TGGTCGTCAA GTTCGCCCCC ACCAACGTCA 

35 CCACTGAAGT CAAGTCCGTT GAGATGCACC ACGAGCAGAT CCCCGAGGGT 
CTCCCCGGAG ACAACGTTGG TTTCAACGTC AAGAACGTTT CCATCAAGGA 
CATCCGACGA GGTAACGTCT GTGGTGACTC CAAGAACGAC CCCCCTATGG 
AGGCTGCTTC TTTCAACGCC CAGGTTATCG TCCTTAACCA CCCTGGTCAG 
ATCGGTGCCG GTTACACCCC CGTTCTCGAC TGTCACACTG CCCACATTGC 

10 TTGCAAGTTC TCTGAGTTGA TCGAGAAGAT TGACCGACGA ACCGGTAAGG 
TCATGGAGGC CGCCCCCAAG TTCGTCAAGT CTGGTGACGC CGCCATTGTC 
AAGCTTGTTT CCCAGAAGCC TCTCTGTGTT GAGACCTACG CCGACTACCC 
CCCTCTTGGT CGATTCGCCG TCCGAGACAT GCGACAGACC GTTGCCGTTG 
GTGTTATTAA GAGTGTGGAG AAGTCCGATG GGAAGAGCGG CAAGGTTACC 

15 AAGGCCGCCG AGAAGGCTGC TAAGAAG/^G TAA 



2) INFORMATION FOR SEQ ID NO: 1986 



50 
100 
150 
200 
250 
300 
350 
400 
450 
500 
550 
600 
650 
700 
750 
800 
850 
900 
950 
1000 
1050 
1100 
1150 
1200 
1250 
1300 
1350 
1383 



50 



(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 1380 bases 

(B) TYPE: Nucleic acid 
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(C) STRANDEDNESS: Double 

(D) TOPOLOGY: Linear 



(ii) 



MOLECULE TYPE: Genomic DNA 



5 



(vi) 



ORIGINAL SOURCE: 

(A) ORGANISM: Cryptococcus neoformans 

(B) STRAIN: B3501 

(C) ACCESSION NUMBER: U81803 



LO 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 1986 

ATGGGTAAGG ACAAGCTGCA CGTCAACGTC GTTGTTATCG GTCACGTCGA 
CTCCGGTAAG TCGACCACCA CCGGTCACTT GATCTACAAG TGCGGTGGTA 

L5 TCGACAAGCG AACCATTGAG AAGTTCGAGA AGGAGGCTCA AGAGCTCGGA 
AAGTCTTCTT TCAAGTACGC TTGGGTTCTT GACAAGCTTA AGGCCGAGCG 
AGAGCGAGGT ATCACCATCG ACATTGCTCT TTGGAAGTTC GAGACCCCCA 
GGTACCAGGT CACCGTCATT GACGCCCCCG GTCACCGAGA CTTCATCAAG 
AACATGATCA CCGGTACCTC CCAGGCTGAC TGTGCCATCC TCATCATTGC 

20 CACCGGTATC GGTGAGTTCG AGGCCGGTAT CTCCAAGGAC GGTCAGACCC 
GAGAGCACGC CCTCCTCGCC TTCACCCTCG GTGTCAGGCA GCTCATTGTT 
GCTTGCAACA AGATGGACAC CTGCAAGTGG TCCGAGGACC GATTCAACGA 
AATCGTCAAG GAGACCAACG GTTTCATCAA GAAGGTTGGC TACAACCCCA 
AGGCTGTCCC CTTCGTCCCC ATCTCTGGTT GGCACGGTGA CAACATGTTG 

25 GAGGAGACCA CCAACATGCC CTGGTACAAG GGATGGACCA AGGAGACCAA 
GTCTGGTGTT TCCAGGGGTA AGACCCTTCT CGAGGCCATC AGCGCCAGTA 
GGCCCCATAC CCGACCCACC GACAAGCCCC TCCGTCTCCC TCTCCAGGAC 
GTCTACAAGA TCGGTGGTAT CGGCACAGTC CCTGTCGGCC GAGTCGAGAC 
CGGTGTCATC AAGGCCGGTA TGGTCGTCAA GTTCGCCCCC ACCAACGTCA 

30 CCACTGAAGT CAAGTCCGTT GAGATGCACC ACGAGCAGAT CCCCGAGGGT 
CTTCCCGGAG ACAACGTTGG TTTCAACGTC AAGAACGTTT CCATCAAGGA 
CATCCGACGA GGTAACGTCT GTGGTGACTC CAAGAACGAC CCCCCTATGG 
AGGCTGCTTC TTTCAACGCC CAGGTTATCG TCCTT7U\CCA CCCTGGTCAG 
ATCGGTGCCG GTTACACCCC CGTTCTCGAC TGTCACACTG CCCACATTGC 

35 CTGCAAGTTT GCTGAGTTGA TCGAGAAGAT TGACCGACGA ACCGGTAAGG 
TCATGGAGGC CGCCCCCAAG TTCGTCAAGT CTGGTGACGC CGCCATTGTC 
AAGCTTGTTG CCCAGAAGCC CCTCTGTGTT GAGACCTACG CCGACTACCC 
CCCTCTTGGT CGATTCGCCG TCCGAGACAT GCGACAGACC GTTGCCGTTG 
GTGTTATCAA GAGCGTGGAC AAGACCGAGA AGGGTGGCAA GGTCACCAAG 

10 GCTGCTGAGA AGGCTGCCAA GAAGAAGTT^ 



2) INFORMATION FOR SEQ ID NO: 1987 



50 
100 
150 
200 
. 250 
300 
350 
400 
450 
500 
550 
600 
650 
700 
750 
800 
850 
900 
950 
1000 
1050 
1100 
1150 
1200 
1250 
1300 
1350 
1380 



(i) 



SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 1377 bases 

(B) TYPE: Nucleic acid 

(C) STRANDEDNESS: Double 

(D) TOPOLOGY: Linear 



50 



(ii) 



MOLECULE TYPE: Genomic DNA 



1030 



SUBSTITUTE SHEET (RULE 26) 



wo 01/23604 PCT/CAOO/01150 

(vi) ORIGINAL SOURCE: 

(A) ORGANISM: Saccharomyces cerevisiae 
(C) ACCESSION NUMBER: X01638 



5 (xi) SEQUENCE DESCRIPTION: SEQ ID NO: 1987 

ATGGGTAAAG AGAAGTCTCA CATTAACGTT GTCGTTATCG GTCATGTCGA 50 

TTCTGGTAAG TCTACCACTA CCGGTCATTT GATTTACAAG TGTGGTGGTA 100 

TTGACAAGAG AACCATCGAA AAGTTCGAAA AGGAAGCCGC TGAATTAGGT 150 

10 AAGGGTTCTT TCAAGTACGC TTGGGTTTTG GACAAGTTAA AGGCTGAAAG 200 

AGAAAGAGGT ATCACTATCG ATATTGCTTT GTGGAAGTTC GAAACTCCAA 250 

AGTACCAAGT TACCGTTATT GATGCTCCAG GTCACAGAGA TTTCATCAAG 300 

AACATGATTA CTGGTACTTC TCAAGCTGAC TGTGCTATCT TGATTATTGC 350 

TGGTGGTGTC GGTGAATTCG AAGCCGGTAT CTCTAAGGAT GGTCAAACCA 400 

15 GAGAACACGC TTTGTTGGCT TTCACCTTGG GTGTTAGACA ATTGATTGTT 450 

GCTGTCAACA AGATGGACTC CGTCAAATGG GACGAATCCA GATTCCAAGA 500 

AATTGTCAAG GAAACCTCCA ACTTTATCAA GAAGGTTGGT TACAACCCAA 550 

AGACTGTTCC ATTCGTCCCA ATCTCTGGTT GGAACGGTGA CAACATGATT 600 

GAAGCTACCA CCAACGCTCC ATGGTACAAG GGTTGGGAAA AGGAAACCAA 650 

20 GGCCGGTGTC GTCAAGGGTA AGACTTTGTT GGAAGCCATT GACGCCATTG 7 00 

AACAACCATC TAGACCAACT GACAAGCCAT TGAGATTGCC ATTGCAAGAT 750 

GTTTACAAGA TCGGTGGTAT TGGTACTGTG CCAGTCGGTA GAGTTGAAAC 800 

CGGTGTCATC AAGCCAGGTA TGGTTGTTAC TTTCGCCCCA GCTGGTGTTA 850 

CCACTGAAGT CAAGTCCGTT GAT^TGCATC ACGAACAATT GGAACAAGGT 900 

25 GTTCCAGGTG ACAACGTTGG TTTCAACGTC AAGAACGTTT CCGTTAAGGA 950 

AATCAGAAGA GGTAACGTCT GTGGTGACGC TAAGAACGAT CCACCAAAGG 1000 

GTTGCGCTTC TTTCAACGCT ACCGTCATTG TTTTGAACCA TCCAGGTCAA 1050 

ATCTCTGCTG GTTACTCTCC AGTTTTGGAT TGTCACACTG CTCACATTGC 1100 

TTGTAGATTC GACGAATTGT TGGAAAAGAA CGACAGAAGA TCTGGTAAGA 1150 

30 AGTTGGAAGA CCATCCAAAG TTCTTGAAGT CCGGTGACGC TGCTTTGGTC 1200 

AAGTTCGTTC CATCTAAGCC AATGTGTGTT GAAGCTTTCA GTGAATACCC 1250 

ACCATTAGGT AGATTCGCTG TCAGAGACAT GAGACAAACT GTCGCTGTCG 1300 

GTGTTATCAA GTCTGTTGAC AAGACTGAAA AGGCCGCTAA GGTTACCT^G 1350 

GCTGCTCAAA AGGCTGCTAA GAAATAA 1377 

35 



2) INFORMATION FOR SEQ ID NO: 1988 



4 0 (i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 1377 bases 

(B) TYPE: Nucleic acid 

(C) STRANDEDNESS: Double 

(D) TOPOLOGY: Linear 



45 



(ii) MOLECULE TYPE: Genomic DNA 



(vi) ORIGINAL SOURCE: 

(A) ORGANISM: Saccharomyces cerevisiae 
50 (C) ACCESSION NUMBER: M10992 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 1988 



1031 



SUBSTITUTE SHEET (RULE 26) 



wo 01/23604 PCT/CAOO/01150 

ATGGGTAAAG AGAAGTCTCA CATTAACGTT GTCGTTATCG GTCATGTCGA 50 

TTCTGGTAAG TCTACCACTA CCGGTCATTT GATTTACAAG TGTGGTGGTA 100 

TTGACAAGAG AACCATCGAA AAGTTCGAAA AGGAAGCCGC TGAATTAGGT 150 

AAGGGTTCTT TCAAGTACGC TTGGGTTTTG GACAAGTTAA AGGCTGAAAG 200 

5 AGAAAGAGGT ATCACTATCG ATATTGCTTT GTGGAAGTTC GAAACTCCAA 250 

AGTACCAAGT TACCGTTATT GATGCTCCAG GTCACAGAGA TTTCATCAAG 300 

AACATGATTA CTGGTACTTC TCAAGCTGAC TGTGCTATCT TGATTATTGC 350 

TGGTGGTGTC GGTGAATTCG AAGCCGGTAT CTCTAAGGAT GGTCAAACCA 400 

GAGAACACGC TTTGTTGGCT TTCACCTTGG GTGTTAGACA ATTGATTGTT 4 50 

10 GCTGTCAACA AGATGGACTC CGTCAAATGG GACGAATCCA GATTCCAAGA 500 

AATTGTCAAG GAAACCTCCA ACTTTATCAA GAAGGTTGGT TACAACCCAA 550 

AGACTGTTCC ATTCGTCCCA ATCTCTGGTT GGAACGGTGA CAACATGATT 600 

GAAGCTACCA CCAACGCTCC ATGGTACAAG GGTTGGGAAA AGGAAACCAA 650 

GGCCGGTGTC GTCAAGGGTA AGACTTTGTT GGAAGCCATT GACGCCATTG 7 00 

15 AACAACCATC TAGACCAACT GACAAGCCAT TGAGATTGCC ATTGCAAGAT 7 50 

GTTTACAAGA TTGGTGGTAT TGGTACTGTG CCAGTCGGTA GAGTTGAAAC 800 

CGGTGTCATC AAGCCAGGTA TGGTTGTTAC TTTTGCCCCA GCTGGTGTTA 850 

CCACTGAAGT CAAGTCCGTT GAAATGCATC ACGAACAATT GGAACAAGGT 900 

GTTCCAGGTG ACAACGTTGG TTTCAACGTC AAGAACGTTT CCGTTAAGGA 950 

2 0 AATCAGAAGA GGTAACGTCT GTGGTGACGC TAAGAACGAT CCACCAAAGG 1000 

GTTGCGCTTC TTTCAACGCT ACCGTCATTG TTTTGAACCA TCCAGGTCAA 1050 

ATCTCTGCTG GTTACTCTCC AGTTTTGGAT TGTCACACTG CTCACATTGC 1100 

TTGTAGATTC GACGAATTGT TGGAAAAGAA CGACAGAAGA TCTGGTAAGA 1150 

AGTTGGAAGA CCATCCAAAG TTCTTGAAGT CCGGTGACGC TGCTTTGGTC ' 1200 

25 AAGTTCGTTC CATCTAAGCC AATGTGTGTT GAAGCTTTCA GTGAATACCC 1250 

ACCATTAGGT AGATTCGCTG TCAGAGACAT GAGACAAACT GTCGCTGTCG 1300 

GTGTTATCAA GTCTGTTGAC AAGACTGAAA AGGCCGCTAA GGTTACCAAG 1350 

GCTGCTCAAA AGGCTGCTAA GAAATAA 1377 



30 



2) INFORMATION FOR SEQ ID NO: 1989 



(i) SEQUENCE CHARACTERISTICS: 
35 (A) LENGTH: 1377 bases 

(B) TYPE: Nucleic acid 

(C) STRANDEDNESS: Double 

(D) TOPOLOGY: Linear 

40 (ii) MOLECULE TYPE: Genomic DNA 

(vi) ORIGINAL SOURCE: 

(A) ORGANISM: Eremothecium gossypii 

(B) STRAIN: ATCC 10895 

45 (C) ACCESSION NUMBER: X73978 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 1989 

ATGGGTAAGG AAAAGACTCA CGTTAACGTT GTCGTCATCG GTCACGTCGA 50 

50 CTCTGGTAAG TCTACTACCA CCGGTCACTT GATCTACAAG TGTGGTGGTA 100 

TTGACAAGAG AACCATCGAG AAGTTCGAGA AGGAGGCTGC CGAGTTGGGT 150 

AAGGGTTCTT TCAAGTACGC CTGGGTTTTG GACAAATTGA AGGCTGAGAG 200 

AGAGAGAGGT ATCACCATCG ACATTGCGTT GTGGAAGTTC GAGACTCCAA 250 

1032 



SUBSTITUTE SHEET (RULE 26) 



wo 01/23604 PCT/CAOO/01150 

AGTACCACGT CACTGTCATT GACGCCCCAG GCCACAGAGA CTTCATCAAG 300 

AACATGATTA CCGGTACTTC TCAAGCTGAC TGTGCCATCT TGATCATTGC 350 

TGGTGGTGTC GGTGAGTTCG AGGCTGGTAT CTCCAAGGAC GGTCAGACCA 400 

GAGAGCACGC TTTGTTGGCT TACACCTTGG GTGTCAAGCA GTTGATCGTT 450 

5 GCCATCAACA AGATGGACTC CGTCAAGTGG GACGAGTCCA GATACCAGGA 500 

GATTGTCAAG GAGACCTCCA ACTTCATCAA GAAGGTCGGT TACAACCCTA 550 

AGACTGTTCC ATTCGTTCCA ATCTCCGGCT GGAACGGTGA CAACATGATT 600 

GAGGCCACCA CCAACGCCCC ATGGTACAAG GGCTGGGAGA AGGAGACCAA 650 

GGCTGGTGCC GTCAAGGGTA AGACCTTGTT GGAGGCCATT GACGCCATTG 700 

10 AGCCACCTGT CAGACCAACT GACAAGGCAT TGAGATTGCC ATTGCAGGAT 750 

GTCTACAAGA TCGGTGGTAT TGGTACGGTT CCAGTCGGCA GAGTCGAGAC 800 

CGGTGTCATC AAGCCAGGTA TGGTTGTTAC CTTCGCCCCA TCCGGTGTCA 850 

CCACTGAAGT CAAGTCCGTC GAGATGCACC ACGAGCAATT GGAGGAGGGT 900 

GTCCCAGGTG ACAACGTTGG TTTCAACGTC AAGAACGTCT CCGTCAAGGA 950 

15 GATCAGAAGA GGTAACGTTT GCGGTGACTC CAAGAACGAC CCACCAAAGG 1000 

CTGCTGAGTC CTTCAACGCT ACCGTCATTG TCTTGAACCA CCCAGGTCAA 1050 

ATCTCTGCCG GTTACTCTCC AGTCTTGGAC TGTCACACTG CCCACATTGC 1100 

TTGTAAGTTC GACGAGTTGT TGGAGAAGAA CGACAGAAGA ACCGGTAAGA 1150 

AGTTGGAAGA CTCTCCAAAG TTCCTAAAGG CCGGTGACGC TGCCATGGTC 1200 

20 AAGTTTGTCC CATCCAAGCC AATGTGTGTT GAGGCTTTCA CCGACTACCC 1250 

ACCATTGGGT AGATTCGCTG TCAGAGACAT GAGACAGACC GTTGCTGTCG 1300 

GTGTCATCAA GTCTGTTGTC AAGTCCGACA AGGCTGGTAA GGTCACCAAG 1350 

GCCGCCCAAA AGGCTGGTAA GAAATAG 1377 

25 

2) INFORMATION FOR SEQ ID NO: 1990 

(i) SEQUENCE CHARACTERISTICS: 
30 (A) LENGTH: 1377 bases 

(B) TYPE: Nucleic acid 

(C) STRANDEDNESS : Double 

(D) TOPOLOGY: Linear 

35 (ii) MOLECULE TYPE: Genomic DNA 

(vi) ORIGINAL SOURCE: 

(A) ORGANISM: Eremothecium gossypii 
(C) ACCESSION NUMBER: A29820 



iO 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 1990 



ATGGGTAAGG AAAAGACTCA CGTTAACGTT GTCGTCATCG GTCACGTCGA 50 

CTCTGGTAAG TCTACTACCA CCGGTCACTT GATCTACAAG TGTGGTGGTA 100 

J 5 TTGACAAGAG AACCATCGAG AAGTTCGAGA AGGAGGCTGC CGAGTTGGGT 150 

AAGGGTTCTT TCAAGTACGC CTGGGTTTTG GACAAATTGA AGGCTGAGAG 200 

AGAGAGAGGT ATCACCATCG ACATTGCGTT GTGGAAGTTC GAGACTCCAA 250 

AGTACCACGT CACTGTCATT GACCCCCCAG GCCACAGAGA CTTCATCAAG 3 00 

AACATGATTA CCGGTACTTC TCAAGCTGAC TGTGCCATCT TGATCATTGC 350 

50 TGGTGGTGTC GGTGAGTTCG AGGCTGGTAT CTCCAAGGAC GGTCAGACCA 400 

GAGAGCACGC TTTGTTGGCT TACACCTTGG GTGTCAAGCA GTTGATCGTT 450 

GCCATCAACA AGATGGACTC CGTCAAGTGG GACGAGTCCA GATACCAGGA 500 

GATTGTCAAG GAGACCTCCA ACTTCATCAA GAAGGTCGGT TACAACCCTA 550 

1033 



SUBSTfTUTE SHEET (RULE 26) 



wo 01/23604 



PCT/CAOO/01150 



AGACTGTTCC ATTCGTTCCA ATCTCCGGCT 
GAGGCCACCA CCAACGCCCC ATGGTACAAG 
GGCTGGTGCC GTCAAGGGTA AGACCTTGTT 
AGCCACCTGT CAGACCAACT GACAAGGCAT 
5 GTCTACTU^GA TCGGTGGTAT TGGTACGGTT 
CGGTGTCATC AAGCCAGGTA TGGTTGTTAC 
CCACTGAAGT CAAGTCCGTC GAGATGCACC 
GTCCCAGGTG ACAACGTTGG TTTCAACGTC 
GATCAGAAGA GGTAACGTTT GCGGTGACTC 

10 CTGCTGAGTC CTTCAACGCT ACCGTCATTG 
ATCTCTGCCG GTTACTCTCC AGTCTTGGAC 
TTGTAAGTTC GACGAGTTGT TGGAGAAGAA 
AGTTGGAAGA CTCTCCAAAG TTCCTAAAGG 
AAGTTTGTCC CATCCAAGCC AATGTGTGTT 

L5 ACCATTGGGT AGATTCGCTG TCAGAGACAT 
GTGTCATCAA GTCTGTTGTC AAGTCCGACA 
GCCGCCCAAA AGGCTGGTAA GAAATAG 



GGAACGGTGA CAACATGATT 600 

GGCTGGGAGA AGGAGACCAA 650 

GGAGGCCATT GACGCCATTG 700 

TGAGATTGCC ATTGCAGGAT 750 

CCAGTCGGCA GAGTCGAGAC 800 

CTTCGCCCCA TCCGGTGTCA 850 

ACGAGCAATT GGAGGAGGGT 900 

AAGAACGTCT CCGTCAAGGA 950 

CAAGAACGAC CCACCAAAGG 1000 

TCTTGAACCA CCCAGGTCAA 1050 

TGTCACACTG CCCACATTGC 1100 

CGACAGAAGA ACCGGTAAGA 1150 

CCGGTGACGC TGCCATGGTC 1200 

GAGGCTTTCA CCGACTACCC 1250 

GAGACAGACC GTTGCTGTCG 13 00 

AGGCTGGTAA GGTCACCAAG 1350 

1377 



20 



30 



35 



2) INFORMATION FOR SEQ ID NO: 1991 



(i) SEQUENCE CHARACTERISTICS: 
(A) LENGTH: 164 6 bases 
25 (B) TYPE: Nucleic acid 

(C) STRANDEDNESS: Double 

(D) TOPOLOGY: Linear 



(ii) MOLECULE TYPE: Genomic DNA 

(vi) ORIGINAL SOURCE: 

(A) ORGANISM: Aspergillus oryzae 

(B) STRAIN: KBN616 

(C) ACCESSION NUMBER: AB007770 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 1991 



TGGGGTAAGT TTATCAACCC GTCGAGTTGT GTTGCATCTC AGATCATGGC 50 

TGACAAGTAC TTTCCTCCTT ACAGTAAGGA AGACAAGCAG CACATCAACA 100 

10 TCGTCGTTAT CGGCCACGTC GATTCCGGCA AGTCCACCAC CACTGGTCAC 150 

TTGATCTACA AGTGTGGTGG TATCGACCAG CGTACCATCG AGAAGTTCGA 200 

GAAGGAAGCC GCTGAGCTCG GTAAGGGTTC CTTCAAGTAC GCCTGGGTTC 250 

TTGACAAGCT CAAGTCCGAG CGTGAGCGTG GTATCACCAT CGATATCGCC 300 

CTCTGGT^GT TCCAGACCTC CT^^GTATGAG GTCACCGTCA TTGGTAAGCA 350 

45 TTTGAGTTCC AACCTACGTT GCCCAACATT TACAGTCATC TAACAAAGTT 4 00 

CAATAGATGC CCCCGGTCAC CGTGACTTCA TCAAGAACAT GATCACTGGT 450 

ACTTCCCAGG CTGACTGCGC TATCCTCATC ATTGCCTCCG GTACTGGTGA 500 

ATTCGAGGCT GGTATCTCCA AGGATGGTCA GACCCGTGAG CACGCTCTGC 550 

TCGCTTTCAC CCTCGGTGTC CGTCAGCTCA TCGTTGCCCT CAACAAGATG 600 

50 GACACCTGCA AGTGGTCTCA GGATCGTTAC AACGAAATCG TTAAGGAGAC 650 

TTCCAACTTC ATCAAGAAGG TCGGATACAA CCCCAAGAGC GTTCCTTTCG 700 

TCCCCATCTC CGGTTTCAAC GGTGACAACA TGATTGAGGC CTCCACCAAC 750 

TGCCCCTGGT ACAAGGGCTG GGAGAAGGAG ACCAAGGCTG GCAAGTCCAC 800 

1034 



SUBSTITUTE SHEET (RULE 26) 



wo 01/23604 PCT/CAOO/01 150 

CGGTAAGACC CTTCTCGAGG CCATCGATGC CATCGAGCCC CCCGTCCGTC 850 

CCACCGACAA GCCTCTCCGT CTTCCCCTCC AGGATGTCTA CAAGATCTCT 900 

GGTATCGGTA CTGTGCCCGT CGGTCGTGTC GAGACTGGTG TCATCAAGCC 950 

TGGTATGGTC GTTACTTTCG CTCCTGCCAA CGTGACCACT GAAGTCAAGT 1000 

5 CCGTTGAAAT GCACCACCAG CAGCTCCAGG CCGGTAACCC CGGTGACAAC 1050 

GTTGGTTTCA ACGTCAAGAA CGTCTCCGTC AAGGAAGTCC GCCGTGGTAA 1100 

CGTTGCCGGT GACTCCAAGA ACGACCCCCC TGCTGGCTGC GATTCCTTCA 1150 

ACGCCCAGGT CATCGTCCTT AACCACCCCG GTCAGGTCGG CAACGGTTAC 1200 

GCTCCCGTCC TGGACTGCCA CACCGCTCAC ATTGCTTGCA AGTTCGCTGA 1250 

10 GCTCCTTGAG AAGATTGACC GCCGTACCGG TAAATCTGTT GAGGACAAGC 1300 

CCAAGTTCAT CAAGTCTGGT GATGCTGCCA TCGTCAAGAT GATTCCCTCC 1350 

AAGCCCATGT GTGTGGAGTC TTTCACTGAC TTCCCCCCTC TTGGTCGTTT 14 00 

CGCTGTCCGT GACGTAAGTT TTTCCCTCTT GACTATCTTC ACAATTTTTC 14 50 

ACATATTTTC ACGCCTCGTC CCACTCTTTT TCCTCCCTTC CTCTTTGGTT 1500 

15 CCCCTTTTTG CCTGCAAGTT CTCTATAGCT AACATGATGT CTAGATGCGT 1550 

CAAACTGTTG CCGTCGGAGT TATCAAGTCG GTTGAGAAGA ACACTGGCGG 1600 

TTCTGGCAAG GTCACCAAGG CCGCCCAGAA GGCTGGCAAG AAATAA 164 6 



20 



30 



35 



2) INFORMATION FOR SEQ ID NO: 1992 



(i) SEQUENCE CHARACTERISTICS: 
(A) LENGTH: 1380 bases 
25 (B) TYPE: Nucleic acid 

(C) STRANDEDNESS: Double 

(D) TOPOLOGY: Linear 



(ii) MOLECULE TYPE: Genomic DNA 

(vi) ORIGINAL SOURCE: 

(A) ORGANISM: Aureobasidium pullulans 

(B) STRAIN: R106 

(C) ACCESSION NUMBER: U19723 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 1992 



ATGGGTAAGG AAAAGTCCCA CATCAACGTC GTCGTTATCG GCCACGTCGA 50 

CTCCGGTAAG TCGACCACCA CCGGTCACTT GATCTACAAG TGCGGTGGTA 100 

40 TCGACAAGCG TACCATCGAG AAGTTCGAGA AGGAAGCCGC CGAACTCGGC 150 

AAGGGTTCCT TCAAGTACGC CTGGGTCCTC GACAAGCTGA AGTCTGAGCG 200 

TGAGCGTGGT ATCACTATCG ATATCGCTCT GTGGAAGTTC GAGACCCCCA 250 

AGTACATGGT CACCGTCATC GATGCCCCCG GTCACCGTGA TTTCATCAAG 300 

AACATGATCA CTGGTACCTC CCAGGCTGAC TGCGCCATTC TCATCATTGC 350 

45 CGCCGGTACT GGTGAGTTCG AGGCTGGTAT CTCCAAGGAT GGCCAGACTC 400 

GTGAGCACGC CCTTCTCGCC TACACCCTTG GTGTCAAGCA GCTCATCGTC 4 50 

GCTATCAACA AGATGGACAC CACCAAGTGG TCTGAGGCCC GTTACCAGGA 500 

GATCATCAAG GAGACCTCCG GTTTCATCAA GAAGGTCGGC TACAACCCCA 550 

AGCACGTTCC CTTTGTCCCC ATCTCGGGTT TCAACGGTGA CAACATGATT 600 

50 GAGGTTTCTT CCAACTGCCC CTGGTACAAG GGTTGGGAGA AGGAGACCAA 650 

GGCCAAGGCC ACTGGTAAGA CTCTCCTCGA GGCCATTGAC GCCATCGACC 700 

CTCCTTCGCG CCCCACCGAC AAGCCCCTCC GTCTTCCCCT CCAGGATGTC 750 

TACAAGATCG GTGGTATTGG CACGGTGCCC GTCGGCCGTG TCGAGACCGG 800 

1035 



SUBSTITUTE SHEET (RULE 26) 



wo 01/23604 

TACCATCAAG GGTGGTATGG TCGTCACCTT 
CTGAGGTCAA GTCCGTCGAG ATGCACCACG 
CCCGGTGACA ACGTCGGCTT CAACGTCAAG 
CCGTCGTGGT AACGTTGCCG GTGACTCCAA 
5 GCGACTCCTT CAACGCCCAG GTCATCGTCC 
GGTGCTGGTT ACGCACCCGT CCTCGATTGC 
CAAGTTCTCC GAGCTTGTTG AGAAGATTGA 
TTGAGGCCGC CCCCAAGTTC ATCAAGTCTG 
ATGGTTCCCT CCAAGCCTAT GTGTGTTGAG 
10 TCTCGGTCGT TTCGCCGTCC GTGACATGAG 
TCATCAAGTC CGTCGCCAAG TCCGACAAGC 
AAGGCCGCTG TCAAGGCTGG CAAGAAGTAA 



PCT/CAOO/01150 

CGCCCCCGCT GGTGTCACCA 850 

AGCAGCTCTC CGAGGGTCTC 900 

AACGTCTCCG TCAAGGAGAT 950 

GAACGACCCC CCCAAGGGTT 1000 

TCAACCACCC CGGTCAGGTC 1050 

CACACTGCCC ACATCGCCTG 1100 

CCGCCGTACC GGCAAGTCCG 1150 

GTGACGCCGC CATCGTCAAG 1200 

GCCTTCACTG ACTACCCTCC 1250 

ACAGACCGTC GCTGTCGGTG 1300 

AGGGTGCCGG TAAGGTTACC 1350 

1380 



L5 

2) INFORMATION FOR SEQ ID NO: 1993 

(i) SEQUENCE CHARACTERISTICS: 
(A) LENGTH: 1383 bases 
20 (B) TYPE: Nucleic acid 

(C) STRANDEDNESS: Double 

(D) TOPOLOGY: Linear 



25 



30 



(ii) MOLECULE TYPE: Genomic DNA 

(vi) ORIGINAL SOURCE: 

(A) ORGANISM: Histoplasma capsulatum 

(B) STRAIN: 186AS 

(C) ACCESSION NUMBER: U14100 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 1993 



ATGGGTAAGG AAGACAAGAC TCACATTAAC CTCGTCGTCA TCGGCCACGT 50 

CGATTCGGGC AAATCTACCA CCACTGGTCA TTTGATCTAC AAATGCGGTG 100 

35 GTATTGACAG CCGTACCATT GAGAAGTTCG AAAAGGAAGC CGAAGAGTTG 150 

GGCAAGAAAT CCTTCAAATA TGCGTGGGTC CTTGACAAAC TGAAGTCTGA 200 

GCGTGAGCGT GGTATCACCA TCGATATTGC CCTCTGGAAA TTCGAGACTC 250 

CGAAGTACAG TGTCACTGTC ATTGATGCTC CCGGCCATCG TGACTTCATC 300 

AAGAACATGA TCACTGGTAC CTCCCAGGCT GACTGCGCTA TCCTCATCAT 350 

40 TGCTGCCGGT ACTGGTGAGT TCGAGGCTGG TATCTCCAAG GATGGCCAGA 400 

CTCGTGAGCA CGCTCTGCTT GCTTTCACCC TTGGTGTGAG GCAACTCATC 450 

GTTGCCATCA ACAAGATGGA CACCACCAAG TGGTCCGAGT CCCGTTTCAA 500 

CGAAATCATC AAGGAGGTTT CCAACTTCAT CAAGAAGGTC GGATATAACC 550 

CCAAGGCTGT TCCCTTCGTG CCAATCTCTG GTTTCGAGGG TGACAACATG 600 

45 ATTGAACCCT CCCCCAACTG CACATGGTAC AAGGGCTGGA ACAAGGAGAC 650 

TGCCTCTGGC AAGTCTTCTG GTAAAACCCT TCTCGATGCC ATTGACGCCA 700 

TTGAACCCCC AACCCGTCCT ACCGATAAGC CCCTCCGTCT TCCCCTCCAG 750 

GATGTGTACA AAATCTCTGG TATTGGCACT GTTCCCGTCG GACGTGTTGA 800 

GACTGGTGTC ATCAAGCCCG GTATGGTCGT GACTTTCGCT CCCTCCAACG 850 

50 TCACCACTGA AGTCAAGTCC GTCG7WVTGC ACCACCAACA ACTCCAGGCT 900 

GGTTACCCTG GCGACAACGT CGGCTTCAAC GTCAAGAACG TTTCAGTCAA 950 

GGAAGTCCGC CGTGGCAACG TTGCTGGCGA CTCCAAAAAT GATCCCCCTA 1000 

AGGGCTGCGA ATCCTTCAAT GCCCAGGTCA TCGTCCTTAA CCACCCCGGC 1050 

1036 



SUBSTITUTE SHEET (RULE 26) 



wo 01/23604 

CAGGTTGGCG CTGGTTATGC 
TGCTTGCAAG TTCTCTGAGC 
AGTCTGTTGA G/y^CAACCCC 
GTCAAGATGG TTCCCTCCAA 
5 TCCCCCTCTT GGACGTTTCG 
TCGGTGTCAT CAAGTCCGTC 
ACCAAGGCCG CGCAGAAGGC 



CCCAGTCCTC GACTGCCACA 
TTATTGAGAA GATCGACCGC 
AAGTTCATCA AGTCTGGTGA 
GCCCATGTGC GTGGAGCCCT 
CTGTCCGTGA CATGAGACAA 
ATCAAGTCTG ACAAGACTGC 
CACCAAGAAA TAA 



PCT/CAOO/01150 

CTGCCCACAT 1100 

CGTACCGGAA 1150 

TGCTGCTATC 1200 

TCACTGACTA 1250 

ACCGTCGCTG 1300 

TGGCAAGGTC 1350 
1383 



LO 



2) INFORMATION FOR SEQ ID NO: 1994 



(i) SEQUENCE CHARACTERISTICS: 
(A) LENGTH: 1383 bases 
L5 (B) TYPE: Nucleic acid 

(C) STRANDEDNESS : Double 

(D) TOPOLOGY: Linear 



20 



(ii) MOLECULE TYPE: Genomic DNA 

(vi) ORIGINAL SOURCE: 

(A) ORGANISM: Neurospora crassa 
(C) ACCESSION NUMBER: D45837 

25 (xi) SEQUENCE DESCRIPTION: SEQ ID NO: 1994 

ATGGGC/^GG AGGACAAGAC TCACATCAAC GTCGTCGTTA TCGGCCACGT 50 

CGATTCCGGC AAGTCTACCA CTACCGGTCA CTTGATCTAC AAGTGCGGTG 100 

GTATCGACAA GCGTACCATC GAGAAGTTCG AGAAGGAAGC CGCTGAGCTC 150 

30 GGTAAGGGTT CCTTCAAGTA TGCCTGGGTT CTTGACAAGC TCAAGGCCGA 200 

GCGTGAGCGT GGTATCACCA TCGATATCGC CCTCTGGAAG TTCGAGACTC 250 

CCAAGTACTA CGTCACCGTC ATCGATGCCC CCGGTCATCG TGATTTCATC 300 

AAGAACATGA TCACTGGTAC CTCCCAGGCT GATTGCGCTA TCCTCATCAT 350 

TGCCGCCGGT ACTGGTGAGT TCGAGGCTGG TATCTCCAAG GATGGCCAGA 400 

35 CCCGTGAGCA CGCCCTGCTC GCCTACACCC TCGGTGTCAA GCAGCTCATT 450 

GTTGCCATCA ACAAGATGGA CACCACCCAG TGGTCCCAGA CTCGTTTCGA 500 

GGAGATCATC J\AGGAGACCA AGAACTTCAT CAAGAAGGTT GGCTACAACC 550 

CCGCTGGTGT CGCTTTCGTC CCCATCTCCG GCTTCAACGG CGACAACATG 600 

CTTGAGCCCT CCACCAACTG CCCCTGGTAC AAGGGTTGGG AGAAGGAGAC 650 

10 CAAGGCCGGC AAGGCCACTG GCAAGACCCT CCTCGAGGCC ATCGACGCCA 700 

TTGAGCCCCC CAAGCGTCCT ACCGACAAGC CCCTCCGTCT TCCCCTCCAG 750 

GATGTCTACA AGATCGGTGG TATCGGCACA GTGCCCGTCG GCCGTATCGA 800 

GACTGGTGTC CTCAAGCCCG GTATGGTCGT TACCTTCGCT CCTTCCAACG 850 

TCACCACTGA AGTCAAGTCC GTCGAGATGC ACCACGAGCA GCTTGCTCAG 900 

15 GGTGTCCCCG GTGACAACGT CGGCTTCAAC GTGAAGAACG TTTCCGTCAA 950 

GGATATCCGC CGTGGTAACG TTGCCGGTGA CTCCAAGAAC GACCCCCCTG 1000 

CTGGCGCCGC CTCTTTCACC GCCCAGGTCA TCGTTCTCAA CCACCCCGGT 1050 

CAGGTCGGTG CCCGCTACGC CCCCGTCCTC GACTGCCACA CTGCCCACAT 1100 

TGCCTGCAAG TTCGCCGAGC TCCTCGAGAA GATCGACCGC CGTACTGGTA 1150 

50 AGGCTGTTGA GGCCTCCCCC AAGTTCATCA AGTCTGGTGA TGCTGCCATC 1200 

GTCAAGATGA TTCCCTCCAA GCCCATGTGC GTTGAGGCTT TCACCGACTA 1250 

CCCTCCCCTC GGCCGTTTCG CCGTCCGTGA CATGCGTCAG ACCGTCGCCG 1300 

TCGGTGTCAT CAAGGCCGTC GACAAGTCCA CCGCTGCCGC TGGCAAGGTC 1350 

1037 



SUBSTITUTE SHEET (RULE 26) 



wo 01/23604 

ACCAAGTCCG CTGCCAAGGC CGCCAAGAAG TAA 



PCT/CAOO/01150 

1383 



5 2) INFORUdATION FOR SEQ ID NO: 1995 



(i) SEQUENCE CHARACTERISTICS: 
(A> LENGTH: 1383 bases 
(B) TYPE: Nucleic acid 
LO (C) STRANDEDNESS: Double 

(D) TOPOLOGY: Linear 



(ii) MOLECULE TYPE: Genomic DNA 



L5 (vi) ORIGINAL SOURCE: 

(A) ORGANISM: Podospora anserina 
(C) ACCESSION NUMBER: X747 99 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 1995 

20 

ATGGGCAAGG AGGACAAGAC TCACATCAAC GTCGTCGTTA TCGGCCACGT 50 

CGATTCCGGC AAGTCGACCA CCACTGGTCA CTTGATCTAC AAGTGCGGTG 100 

GTATTGACAA GCGTACCATC GAGAAGTTCG AGAAGGAAGC TGCTGAGCTC 150 

GGCAAGGGCT CTTTCAAGTA TGCCTGGGTT CTTGACAAGT TGAAGGCCGA 200 

25 GCGTGAGCGT GGTATCACCA TCGATATTGC CCTCTGGAAG TTCGAGACCC 250 

CCAAGTACTA TGTCACCGTC ATTGATGCCC CCGGCCATCG TGATTTCATC 300 

AAGAACATGA TTACTGGTAC TTCCCAGGCC GATTGCGCCA TTCTCATCAT 350 

TGCCGCCGGT ACTGGTGAGT TCGAGGCTGG TATCTCCAAG GATGGCCAGA 400 

CCCGTGAGCA CGCTCTCCTC GCCTACACCC TCGGTGTGi\A GCAGCTCATC 450 

30 GTCGCCATCA ACAAGATGGA CACCACCAAG TGGTCCGAGG CCCGCTTCAA 500 

CGAGATCATC AAGGAGACCT CCAACTTCAT CAAGAAGGTC GGCTACAACC 550 

CCAAGACTGT TGCCTTCGTC CCCATCTCCG GTTTCAACGG CGACAACATG 600 

CTTGAGGCTT CCACCAACTG CCCCTGGTAC AAGGGCTGGG AGAAGGAGGT 650 

CAAGGGTGGC AAGGCCACCG GCAAGACCCT CCTTGAGGCC ATCGACTCCA 700 

35 TCGAGCCCCC CAAGCGTCCC ACCGACAAGC CCCTCCGTCT TCCCCTCCAG 750 

GATGTCTACA AGATCGGCGG TATCGGCACA GTCCCTGTCG GCCGTATCGA 800 

GACTGGTATC CTCAAGCCCG GTATGGTCGT TACCTTCGCT CCTTCCAACG 850 

TCACCACTGA AGTCAAGTCC GTCGAGATGC ACCACGAGCA . GCTCGCTGAG 900 

GGTGTTCCCG GTGACAACGT TGGTTTCAAC GTGAAGAACG TCTCCGTCAA 950 

iO GGAAATCCGC CGTGGCAACG TTGCCGGTGA CTCCAAGAAC GACCCCCCCA 1000 

TGGGCGCCGC CTCTTTCGAT GCCCAGGTCA TCGTCCTCAA CCACCCCGGC 1050 

CAGGTCGGTG CTGGTTACGC CCCCGTCCTC GATTGCCACA CTGCCCACAT 1100 

CGCCTGCAAG TTCTCTGAGC TCCTGCAGAA GATCGACCGC CGTACTGGTA 1150 

AGGCCGTTGA GGAGAGCCCC AAGTTCATCA AGTCTGGTGA TGCTGCCATC 1200 

45 GTCAAGATGG TTCCCTCCAA GCCCATGTGC GTTGAGGCTT TCACTGAGTA 1250 

CCCTCCCCTC GGTCGTTTCG CCGTCCGTGA CATGCGTCAG ACCGTCGCTG 1300 

TCGGTGTCAT CAAGAAGGTC GAGAAGGCCG CTGCTGGTTC CGGCAAGGTT 1350 

ACCAAGTCCG CTGCCAAGGC TGGCAAGAAA TAA 1383 

50 

2) INFORMATION FOR SEQ ID NO: 1996 
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(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 138 6 bases 

(B) TYPE: Nucleic acid 

(C) STRANDEDNESS: Double 

(D) TOPOLOGY: Linear 

(ii) MOLECULE TYPE: Genomic DNA 

(vi) ORIGINAL SOURCE: 

(A) ORGANISM: Podospora curvicolla 

(B) STRAIN: VLV 

(C) ACCESSION NUMBER: X96614 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 1996 

ATGGGCAAGG AGGACAAGAC TCACATCAAC GTCGTCGTTA TCGGCCACGT 50 

CGATTCCGGC AAGTCGACCA CCACTGGTCA CTTGATCTAC AAGTGCGGTG 100 

GTATTGACAA GCGTACCATC GAGAAGTTCG AGAAGGAAGC TGCTGAGCTC 150 

GGCAAGGGCT CTTTCAAGTA TGCCTGGGTT CTTGACAAGT TGAAGGCCGA 200 

GCGTGAGCGT GGTATCACCA TTGATATCGC CCTCTGGAAG TTCGAGACCC 250 

CCAAGTACTA TGTCACCGTC ATCGATGCCC CCGGCCATCG TGATTTCATC 300 

AAGAACATGA TTACTGGTAC TTCCCAGGCC GATTGCGCCA TTCTCATCAT 350 

TGCCGCCGGT ACTGGTGAGT TCGAGGCTGG TATCTCCAAG GATGGCCAGA 400 

CCCGTGAGCA CGCTCTCCTC GCCTACACCC TCGGTGTGAA GCAGCTCATC 450 

GTCGCCATCA ACAAGATGGA CACCACCAAA TGGTCCGAGG CCCGCTTCAA 500 

CGAGATCATC AAGGAGACCT CCAACTTCAT CAAGT^AGGTC GGCTACAACC 550 

CCAAGACTGT TGCCTTCGTC CCCATCTCCG GTTTCAACGG CGACAACATG 600 

CTTGAGGCTT CCACCAACTG CCCCTGGTAC AAGGGTTGGG AGAAGGAGGT 650 

CAAGGGTGGC AAGGCTACTG GCAAGACCCT CCTCGAGGCC ATCGACTCCA 700 

TCGAGCCCCC CAAGCGTCCC ACCGACAAGC CCCTCCGTCT TCCCCTTCAG 750 

GACGTTTACA AGATCGGCGG TATCGGCACA GTCCCTGTCG GCCGTATCGA 800 

GACTGGTATC CTCAAGCCCG GTATGGTCGT TACCTTCGCC CCTTCCAACG 850 

TCACCACTGA AGTCAAGTCC GTCGAGATGC ACCACGAGCA GCTCTCTGAG 900 

GGTGTCCCCG GTGACAACGT TGGTTTCAAC GTGAAGAACG TCTCCGTCAA 950 

GGAAATCCGC CGTGGCAACG TTGCCGGTGA CTCCAAGAAC GACCCCCCTC 1000 

TTGGCGCCGC TTCTTTCGAT GCCCAGGTCA TCGTCCTCAA CCACCCCGGC 1050 

CAGGTCGGTG CTGGTTACGC CCCCGTCCTC GATTGCCACA CTGCCCACAT 1100 

CGCCTGCAAG TTCGCTGAGC TCCTGCAGAA GATCGATCGC CGTACTGGTA 1150 

AGGCTGTTGA GGAGAGCCCT AAGTTCATCA AGTCTGGTGA TGCTGCCATC 1200 

GTCAAGATGA TTCCCTCCAA GCCCATGTGC GTTGAGGCTT TCACTGAGTA 1250 

CCCTCCCCTC GGTCGTTTCG CTGTCCGTGA CATGCGTCAG ACCGTCGCTG 1300 

TCGGTGTCAT CAAGAAGGTC GAGAAGGCCG CTGCTGGTTC CGGCAAGGTC 1350 

ACCAAGTCCG CTGCCAAGGC TGGTGGCAAG AAATAA 1386 



2) INFORMATION FOR SEQ ID NO: 1997 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 1383 bases 

(B) TYPE: Nucleic acid 

(C) STRANDEDNESS: Double 

(D) TOPOLOGY: Linear 
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LO 



10 



(ii) MOLECULE TYPE: Genomic DNA 

(vi) ORIGINAL SOURCE: 

(A) ORGANISM: Sordaria macrospora 

(B) STRAIN: OOO 

(C) ACCESSION NUMBER: X96615 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 1997 



ATGGGTAAGG AAGACAAGGC TCACATCAAC GTCGTCGTTA TCGGCCACGT 50 

CGATTCCGGC AAGTCCACCA CTACCGGTCA CCTGATCTAC AAGTGCGGTG 100 

GTATCGACAA GCGTACCATC GAGAAGTTCG AGAAGGAAGC CGCTGAGCTC 150 

GGCAAGGGTT CCTTCAAGTA TGCCTGGGTT CTTGACAAGC TCAAGGCCGA 200 

L5 GCGTGAGCGT GGTATCACCA TCGATATCGC CCTCTGGAAG TTCGAGACTC 250 

CCAAGTACTA CGTCACCGTC ATCGATGCCC CCGGCCATCG TGATTTCATC 300 

AAGAACATGA TCACTGGTAC CTCCCAGGCT GATTGCGCTA TTCTCATCAT 350 

TGCCGCTGGT ACTGGTGAGT TCGAGGCTGG TATCTCCAAG GATGGCCAGA 4 00 

CTCGTGAGCA. CGCTCTTCTC GCCTACACCC TCGGTGTCAA GCAGCTCATC 4 50 

10 GTTGCCATCA ACAAGATGGA CACCACCCAG TGGTCCCAGG CTCGTTTCGA 500 

GGAGATCATC AAGGAGACCA AGAACTTCAT CAAGAAGGTC GGCTACAACC 550 

CCGCCACCGT CGCTTTCGTC CCCATCTCCG GCTTCAACGG CGACAACATG 600 

CTTGAGGCCT CCACCAACTG CCCCTGGTAC AAGGGTTGGG AGAAGGAGAC 650 

CAAGGCCGGC AAGTCCACTG GCAAGACCCT CCTCGAGGCC ATCGACGCCA 700 

15 TTGAGCAGCC CAAGCGCCCG ACCGACAAGC CCCTCCGTCT TCCCCTCCAG 7 50 

GATGTCTACA AGATCGGCGG TATCGGCACA GTGCCCGTCG GCCGTATCGA 800 

GACTGGTGTC CTCAAGCCCG GTATGGTCGT TACCTTCGCT CCTTCCAACG 850 

TCACCACTGA AGTCAAGTCC GTCGAGATGC ACCACGAGCA GCTTGCTCAG 900 

GGTGTTCCCG GTGACAACGT CGGCTTCAAC GTGAAGAACG TTTCCGTCAA 950 

30 GGATATCCGT CGTGGTAACG TTGCCGGTGA CTCCAAGAAC GACCCCCCTG 1000 

TCGGCGCTGC CTCTTTCACC GCCCAGGTCA TCGTCCTTAA CCACCCCGGT 1050 

CAGGTCGGTG CCGGCTACGC TCCCGTCCTC GATTGCCACA CTGCCCACAT 1100 

TGCCTGCAAG TTCGCCGAGC TCCTCGAGAA GATCGATCGC CGTACTGGTA 1150 

AGGCTGTTGA GACTTCTCCC AAGTTCATCA AGTCTGGTGA TGCTGCCATC 1200 

15 GTCAAGATGA TTCCCTCCAA GCCCATGTGC GTCGAGGCTT TCACCGACTA 1250 

CCCTCCCCTC GGTCGTTTCG CCGTCCGTGA CATGCGTCAG ACCGTCGCTG 1300 

TCGGTGTCAT CAAGGCCGTC GACAAGACCC AGGCTGTCGC TGGCAAGGTC 1350 

ACCAAGTCTG CTGCCAAGGC TGCCAAGAAG TAA 1383 



2) INFORMATION FOR SEQ ID NO: 1998 



(i) SEQUENCE CHARACTERISTICS: 
15 (A) LENGTH: 1383 bases 

(B) TYPE: Nucleic acid 

(C) STRANDEDNESS: Double 

(D) TOPOLOGY: Linear 

50 (ii) MOLECULE TYPE: Genomic DNA 

(vi) ORIGINAL SOURCE: 

(A) ORGANISM: Trichoderma reesei 
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(B) STRAIN: QM9414 

(C) ACCESSION NUMBER: Z2^3012 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 1998 

5 

ATGGGTAAGG AGGACAAGAC TCACATCAAC GTGGTCGTCA TCGGCCACGT 50 

CGACTCCGGC AAGTCTACCA CCACTGGTCA CTTGATCTAC CAGTGCGGTG 100 

GTATCGACAA GCGTACCATT GAGAAGTTCG AGAAGGAAGC CGCCGAACTC 150 

GGCAAGGGTT CCTTCAAGTA CGCGTGGGTT CTTGACAAGC TCAAGGCCGA 200 

10 GCGTGAGCGT GGTATCACCA TCGACATTGC CCTCTGGAAG TTCGAGACTC 250 

CCAAGTACTA TGTCACCGTC ATTGACGCTC CCGGCCACCG TGACTTCATC 300 

AAGAACATGA TCACTGGTAC TTCCCAGGCC GACTGCGCTA TCCTCATCAT 350 

CGCTGCCGGT ACTGGTGAGT TCGAGGCTGG TATCTCCAAG GATGGCCAGA 400 

CCCGTGAGCA CGCTCTGCTC GCCTACACCC TGGGTGTCAA GCAGCTCATC 450 

15 GTCGCCATCA ACAAGATGGA CACTGCCAAC TGGGCCGAGG CTCGTTACCA 500 

GGAAATCATC AAGGAGACTT CCAACTTCAT CAAGAAGGTC GGCTTCAACC 550 

CCAAGGCCGT TGCTTTCGTC CCCATCTCCG GCTTCAACGG TGACAACATG 600 

CTCACCCCCT CCACCAACTG CCCCTGGTAC AAGGGCTGGG AGAAGGAGAC 650 

CAAGGCTGGC AAGTTCACCG GCAAGACCCT CCTTGAGGCC ATCGACTCCA 700 

20 TCGAGCCCCC CAAGCGTCCC ACGGACAAGC CCCTGCGTCT TCCCCTCCAG 750 

GACGTCTACA AGATCGGTGG TATCGGAACA GTTCCCGTCG GCCGTATCGA 800 

GACTGGTGTC CTCAAGCCCG GTATGGTCGT TACCTTCGCT CCCTCCAACG 850 

TCACCACTGA AGTCAAGTCC GTCGAGATGC ACCACGAGCA GCTCGCTGAG 900 

GGCCAGCCTG GTGACAACGT TGGTTTCAAC GTGAAGAACG TTTCCGTCAA 950 

25 GGAAATCCGC CGTGGCAACG TTGCCGGTGA CTCCAAGAAC GACCCCCCCA 1000 

TGGGCGCCGC TTCTTTCACC GCCCAGGTCA TCGTCATGAA CCACCCCGGC 1050 

CAGGTCGGTG CCGGCTACGC CCCCGTCCTC GACTGCCACA CTGCCCACAT 1100 

TGCCTGCAAG TTCGCCGAGC TCCTCGAGAA GATCGACCGC CGTACCGGTA 1150 

AGGCTACCGA GTCTGCCCCC AAGTTCATCA AGTCTGGTGA CTCCGCCATC 1200 

30 GTCAAGATGA TCCCCTCCAA GCCCATGTGC GTTGAGGCTT TCACCGACTA 1250 

CCCTCCCCTG GGTCGTTTCG CCGTCCGTGA CATGCGCCAG ACCGTCGCTG 1300 

TCGGTGTCAT CAAGGCCGTC GAGAAGTCCT CTGCCGCCGC CGCCAAGGTC 1350 

ACCAAGTCCG CTGCCAAGGC CGCCAAGAAA TAA 1383 

35 

2) INFORMATION FOR SEQ ID NO: 1999 

(i) SEQUENCE CHARACTERISTICS: 
40 (A) LENGTH: 29 bases 

(B) TYPE: Nucleic acid 

(C) STRANDEDNESS : Single 

(D) TOPOLOGY: Linear 

45 (ii) MOLECULE TYPE: DNA 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 1999 
CATGTCAAYA TTGGTACTAT TGGTCATGT 29 

50 



2) INFORMATION FOR SEQ ID NO: 2000 
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(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 25 bases 

(B) TYPE: Nucleic acid 

(C) STRANDEDNESS: Single 

(D) TOPOLOGY: Linear 

(ii) MOLECULE TYPE: DNA 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 2000 
CCACCYTCIC TCAMGTTGAA RCGTT 



2) INFORMATION FOR SEQ ID NO: 2001 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 23 bases 

(B) TYPE: Nucleic acid 

(C) STRANDEDNESS: Single 

(D) TOPOLOGY: Linear 

(ii) MOLECULE TYPE: DNA 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 2001 
ACYACITTRA CIGCYGCYAT YAC 



2) INFORMATION FOR SEQ ID NO: 2002 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 21 bases 

(B) TYPE: Nucleic acid 

(C) STRANDEDNESS: Single 

(D) TOPOLOGY: Linear 

(ii) MOLECULE TYPE: DNA 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 2002 
CCIGARGARA GAGCIMGWGG T 



2) INFORMATION FOR SEQ ID NO: 2003 

(i) SEQUENCE CHAEU^CTERISTICS : 

(A) LENGTH: 21 bases 

(B) TYPE: Nucleic acid 

(C) STRANDEDNESS: Single 

1042 

SUBSTITUTE SHEET (RULE 26) 



wo 01/23604 

( D) TOPOLOGY : Linear 

(ii) MOLECULE TYPE: DNA 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 2003 
CATYTCRAIR TTGTCACCTG G 
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2) INFORMATION FOR SEQ ID NO: 2004 



(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 1360 bases 

(B) TYPE: Nucleic acid 

(C) STRANDEDNESS: Double 

(D) TOPOLOGY: Linear 

(ii) MOLECULE TYPE: Genomic DNA 



(vi) ORIGINAL SOURCE: 

(A) ORGANISM: Candida albicans 

(B) STRAIN: SC5314 

(C) ACCESSION NUMBER: Genome project 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 2004 

GCTGCCTTCG ACCGTTCTAA ACCTCATGTC AACATTGGTA CTATTGGTCA 50 

TGTTGATCAT GGTAAAACTA CATTGACTGC TGCTATCACC AAAGTTTTAG 100 

CCGAACAAGG TGGTGCCAAC TTCTTGGATT ATGGTTCTAT TGATAGAGCT 150 

CCAGAAGAAA GAGCTAGAGG TATCACTATT TCCACTGCCC ACGTTGAATA 200 

CGAAACCAAG AACAGACACT ATGCCCACGT TGATTGTCCA GGACACGCTG 250 

ATTATATCAA AAATATGATT ACTGGTGCCG CTCAAATGGA TGGTGCTATC 300 

ATTGTTGTTG CTGCCACTGA TGGTCAAATG CCTCAAACCA GAGAACATTT 350 

GTTATTGGCC AGACAAGTTG GTGTTCAAGA CTTGGTTGTG TTTGTCAACA 4 00 

AAGTCGATAC TATTGATGAC CCTGAAATGT TGGAATTAGT CGAAATGGAA 450 
ATGAGAGAAT TGTTATCCAC CTACGGTTTT GATGGTGACA ACACTCCAGT 500 
TATTATGGGA TCTGCTTTAA TGGCTTTGGA AGACAAGAAA CCAGAAATTG 550 
GTAAGGAAGC TATCTTGAAA TTGTTAGATG CTGTCGATGA ACACATTCCA 6 00 

ACTCCATCAA GAGACTTGGA ACAACCATTT TTGTTACCAG TTGAAGACGT 6 50 

GTTCTCCATC TCCGGTAGAG GAACTGTTGT CACTGGTAGA GTTGAAAGAG 700 
GTGTTTTGAA GAAGGGTGAA GAAATCGAAA TTGTTGGTGG TTTTGACAAA 750 
CCTTACAAGA CTACTGTTAC CGGTATTGAA ATGTTCAAAA AAGAATTAGA 8 00 

CTCTGCTATG GCTGGTGACA ACTGTGGTGT TTTGTTAAGA GGTGTTAAAA 850 
GAGATGAAAT CAAGAGAGGT ATGGTTTTGG CCAAACCAGG TACTGCTACT 900 
TCTCACAAGA AGTTCTTGGC TTCCTTGTAT ATTTTGACTT CCGAAGAAGG 9 50 

TGGTCGTTCC ACTCCATTTG GTGAAGGTTA CAAGCCTCAA TGCTTCTTCA 1000 

GAACTAACGA TGTCACTACC ACATTTTCAT TCCCAGAAGG AGAAGGTGTT 1050 

GATCATTCTC AAATGATCAT GCCAGGTGAC AACATTGAAA TGGTTGGTGA 1100 

ATTGATCAAA TCTTGTCCAT TAGT^GTCAA CCAACGTTTC AACTTGAGAG 1150 

AAGGTGGTAA AACTGTTGGT ACTGGTTTGA TTACCAGAAT CATCGAATAA 1200 

ACAGAATGTG CACTGTGAAT AATAAAAAGA AAAGAGGTAT ATATAGGTGA 12 50 

CTTTGTATTT TGTATTGAAC AATAAAATTC TGTAAATAGT AAGGGCCTCA 13 00 

GAAGTTTTGA TTTGATTTAT GCCATGTGGA CTTGTAGAGA TATCCTTCTC 13 50 

AAACTTCTTG 13 60 



2) INFORMATION FOR SEQ ID NO: 2005 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 1342 bases 

(B) TYPE: Nucleic acid 

(C) STRANDEDNESS: Double 
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(D) TOPOLOGY: Linear 

(ii) MOLECULE TYPE: Genomic DNA 

(vi) ORIGINAL SOURCE: 

(A) ORGANISM: Schizosaccharomyces pombe 
(C) ACCESSION NUMBER: Genome project 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 2005 

AAGCCGCATG TCAATATTGG TACTATTGGT CATGTTGACC ACGGTAAAAC 50 

GACGTTGACG GCTGCTATTA CTAAATGCCT TTCTGATCTT GGTCAAGCTA 100 

GTTTTATGGA TTATAGTCAA ATTGACAAGG CCCCCGAGGA AAAGGCACGT 150 

GGTATTACCA TTTCATCTGC CCATGTTGAA TACGAAACTG CTAATCGTCA 200 

CTATGCCCAT GTGGATTGTC CTGGTCACGC CGATTACATT AAGAATATGA 250 

TTACTGGTGC TGCTACAATG GATGGCGCTA TCATTGTTGT TTCTGCTACC 300 

GATGGTCAAA TGCCTCAAAC TCGTGAACAT TTGCTTCTGG CTCGTCAAGT 350 

CGGTGTAT^G CAAATTGTTG TATACATCAA TAAAGTCGAT ATGGTCGAGC 400 

CTGATATGAT CGAGCTTGTC GAAATGGAAA TGCGTGAGCT ACTCTCCGAA 4 50 

TACGGATTTG ATGGTGACAA TACTCCAATT GTTAGCGGCA GTGCTTTATG 500 

TGCCTTAGAG GGTCGTGAGC CTGAGATTGG TCTCAATAGT ATTACTAAAT 550 

TGATGGAAGC TGTTGATAGT TATATTACTC TTCCTGAAAG AAAAACGGAT 600 

GTCCCTTTCT TGATGGCCAT CGAGGACGTT TTTTCi^ATTT CAGGTCGCGG 650 

AACTGTAGTC ACTGGCCGTG TCGAGCGCGG TACTTTAAAG AAGGGTGCTG 700 

AAATCGAAAT CGTCGGTTAT GGTAGCCATT TAAAGACTAC CGTTACTGGA 750 

ATTGAAATGT TCAAAAAGCA GCTTGATGCC GCCGTTGCCG GTGACAATTG 800 

TGGCCTTTTA CTTCGTTCTA TCAAGCGAGA GCAATTAAAA CGTGGAATGA 850 

TTGTCGCTCA ACCAGGAACC GTTGCTCCTC ATCAGAAATT CAAGGCATCA 900 

TTCTATATTT TGACAAAAGA GGAAGGAGGT CGTCGTACCG GTTTCGTTGA 950 

CAAGTATCGT CCCCAACTGT ACAGTCGTAC TTCCGACGTT ACTGTCGAAC 1000 

TTACCCACCC TGATCCTAAC GACTCAGACA AAATGGTTAT GCCTGGAGAC 1050 

AATGTCGAGA TGATCTGTAC GCTTATTCAC CCCATTGTCA TCGAAAAAGG 1100 

ACAACGCTTC ACAGTTCGTG AGGGTGGAAG CACTGTAGGC ACAGCTTTGG 1150 

TTACTGAACT TTTGGATTAG TGCATTTATG AACTTATTGG CTTTAAAAAT 12 0 0 

TTTGCATGCT GAATACCAAT ATTATGTCCC TTCTCAGAAT TCTATAACTA 125 0 

CAGTGTCATT ATTGTAATAA GACTTTTGCA TCCATTGACA ATGGTATTTG 13 0 0 

ATACTTTTAT AGTTTCTACT ATTGTTAGCC AAAGTTATAA AA 1342 



2) INFORMATION FOR SEQ ID NO: 2006 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 18 bases 

(B) TYPE: Nucleic acid 

(C) STRANDEDNESS: Single 

(D) TOPOLOGY: Linear 

(ii) MOLECULE TYPE: DNA 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 2006 
TGGAGCCGGT GAGCGTGG 



2) INFORMATION FOR SEQ ID NO: 2007 
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(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 18 bases 

(B) TYPE: Nucleic acid 

(C) STRANDEDNESS : Single 

(D) TOPOLOGY: Linear 

(ii) MOLECULE TYPE: DNA 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 2007 
TGGAGCCAGT GAGCGTGG 18 



2) INFORMATION FOR SEQ ID NO: 2008 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 19 bases 

(B) TYPE: Nucleic acid 

(C) STRANDEDNESS: Single 

(D) TOPOLOGY: Linear 

(ii) MOLECULE TYPE: DNA 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 2008 
TCTGGAGCCG ATGAGCGTG 19 



2) INFORMATION FOR SEQ ID NO: 2009 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 19 bases 

(B) TYPE: Nucleic acid 

(C) STRANDEDNESS: Single 

(D) TOPOLOGY: Linear 

(ii) MOLECULE TYPE: DNA 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 2009 
CTGGAGCCAG TAAGCGTGG 19 



2) INFORMATION FOR SEQ ID NO: 2010 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 861 bases 

(B) TYPE: Nucleic acid 

(C) STRANDEDNESS: Double 

(D) TOPOLOGY: Linear 

(ii) MOLECULE TYPE: Genomic DNA 
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(vi) ORIGINAL SOURCE: 

(A) ORGANISM: Klebsiella pneumoniae 

(B) STRAIN: KMK107 

(C) ACCESSION NUMBER: AF027199 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 2010 

ATGAGTATTC AACATTTTCG TGTCGCCCTT ATTCCCTTTT TTGCGGCATT 50 

TTGCCTTCCT GTTTTTGCTC ACCCAGAT^C GCTGGTGAAA GTAAAAGATG 100 

CTGAAGATCA GTTGGGTGCA CGAGTGGGTT ACATCGAACT GGATCTCAAC 150 

AGCGGTAAGA TCCTTGAGAG TTTTCGCCCC GAAGAACGTT TTCCAATGAT 200 

GAGCACTTTT AAAGTTCTGC TATGTGGTGC GGTATTATCC CGTGTTGACG 250 

CCGGGCAAGA GCAACTCGGT CGCCGCATAC ACTATTCTCA GAATGACTTG 3 00 

GTTAAGTACT CACCAGTCAC AGAAAAGCAT CTTACGGATG GCATGACAGT 350 

AAGAGAATTA TGCAGTGCTG CCATAACCAT GAGTGATAAC ACTGCTGCCA 400 

ACTTACTTCT GACAACGATC GGAGGACCGA AGGAGCTAAC CGCTTTTTTG 4 50 

CACAACATGG GGGATCATGT AACTCGCCTT GATCGTTGGG AACCGGAGCT 500 

GAATGAAGCC ATACCAAACG ACGAGCGTGA CACCACGACG CCTGCAGCAA 550 

TGGCAACAAC GTTGCGCAAA CTATTAACTG GCGAACTACT TACTCTAGCT 6 00 

TCCCGGCAAC AATTAATAGA CTGGATGGAG GCGGATAAAG TTGCAGGACC 650 

ACTTCTGCGC TCGGCCCTTC CGGCTGGCTG GTTTATTGCT GATAAATCTG 700 

GAGCCAGTGA GCGTGGGTCT CGCGGTATCA TTGCAGCACT GGGGCCAGAT 750 

GGTAAGCCCT CCCGTATCGT AGTTATCTAC ACGACGGGGA GTCAGGCAAC 800 

TATGGATGAA CGAAATAGAC AGATCGCTGA GATAGGTGCC TCACTGATTA 850 

AGCATTGGTA A 861 



2) INFORMATION FOR SEQ ID NO: 2011 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 861 bases 

(B) TYPE: Nucleic acid 

(C) STRANDEDNESS: Double 

(D) TOPOLOGY: Linear 

(ii) MOLECULE TYPE: Genomic DNA 

(vi) ORIGINAL SOURCE: 

(A) ORGANISM: Klebsiella pneumoniae 

(B) STRAIN: CLSiS L-491 

(C) ACCESSION NUMBER: 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 2011 

ATGAGTATTC AACATTTCCG TGTCGCCCTT ATTCCCTTTT TTGCGGCATT 50 

TTGCTTTCCT GTTTTTGCTC ACCCAGAAAC GCTGGTGAAA GTAAAAGATG 100 

CTGAAGATCA GTTGGGTGCA CGAGTGGGTT ACATCGAGCT GGATCTCAAC 150 

AGCGGTAAGA TCCTTGAGAG TTTTCGCCCC GAAGAACGTT TTCCAATGAT 200 

GAGCACTTTT AAAGTTCTGC TATGTGGTGC GGTATTATCC CGTGTTGACG 250 

CCGGGCAAGA GCAACTCGGT CGCCGCATAC ACTATTCTCA GAATGACTTG 3 00 

GTTGAGTACT CACCAGTCAC AGAAAAGCAT CTTACGGATG GCATGACAGT 3 50 

AAGAGAATTA TGCAGTGCTG CCATAACCAT GAGTGATAAC ACTGCGGCCA 400 

ACTTACTTCT GACAACGATC GGAGGACCGA AGGAGCTAAC CGCTTTTTTG 450 

CACAACATGG GGGATCATGT AACCCGCCTT GATCGTTGGG AACCGGAGCT 500 

GAATGAAGCC ATACCAAACG ACGAGCGTGA CACCACGATG CCTGCAGCAA 550 

TGGCAACAAC GTTGCGCAAA CTATTAACTG GCGAACTACT TACTCTAGCT 600 

1047 



SUBSTITUTE SHEET (RULE 26) 



wo 01/23604 



PCT/CAOO/01150 



TCCCGGCAAC AATTAATAGA CTGGATGGAG GCGGATAAAG TTGCAGGACC 
ACTTCTGCGC TCGGCCCTTC CGGCTGGCTG GTTTATTGCT GATAAATCTG 
GAGCCAGTAA GCGTGGATCT CGCGGTATCA TTGCAGCACT GGGGCCAGAT 
GGTAAGCCCT CCCGTATCGT AGTTATCTAC ATGACGGGGA GTCAGGCAAC 
TATGGATGAA CGAAATAGAC AGATCGCTGA GATAGGTGCC TCACTGATTA 
AGCATTGGTA A 



650 
700 
750 
800 
850 
861 



2) INFORMATION FOR SEQ ID NO: 2012 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 4 9 bases 

(B) TYPE: Nucleic acid 

(C) STRANDEDNESS : Single 

(D) TOPOLOGY: Linear 

(ii) MOLECULE TYPE: DNA 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 2012 
CCGCGGATTA TTAAACCGCC CTTCCGCGG-MR-HEG-ATGTCAGAGG GATAGATCCA 



2) INFORMATION FOR SEQ ID NO: 2013 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 353 bases 

(B) TYPE: Nucleic acid 

(C) STRANDEDNESS: Double 

(D) TOPOLOGY: Linear 

(ii) MOLECULE TYPE: Genomic DNA 

(vi) ORIGINAL SOURCE: 

(A) ORGANISM: Kluyvera ascorhata 

(B) STRAIN: ATCC 33433 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 2013 

AGCTTAAGAA CTCTTATCTG GATTACGCGA TGTCGGTCAT TGTTGGCCGT 50 

GCGCTGCCGG ATGTCCGAGA TGGCCTGAAG CCGGTACACC GTCGCGTACT 100 

TTACGCCATG AACGTATTGG GCAATGACTG GAACAAAGCC TACAAAAAAT 150 

CAGCCCGTGT CGTGGGTGAC GTGATCGGTA AATATCACCC GCATGGTGAT 200 

ACTGCCGTCT ATGACACTAT CGTCCGTATG GCACAGCCAT TCTCACTGCG 250 

ATACATGCTG GTAGATGGTC AAGGTAACTT CGGTTCTGTC GATGGCGACT 300 

CCGCCGCAGC GATGCGTTAT ACGGAAATCC GTATGTCGAA AATCGCCCAT 350 

GAG 353 



2) INFORMATION FOR SEQ ID NO: 2014 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 355 bases 

(B) TYPE: Nucleic acid 
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(C) STRANDEDNESS: Double 

(D) TOPOLOGY: Linear 

(ii) MOLECULE TYPE: Genomic DNA 

(vi) ORIGINAL SOURCE: 

(A) ORGANISM: Kluyvera georgxana 

(B) STRAIN: ATCC 51603 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 2014 

AGCTCCTATC TGGATTATGC GATGTCGGTC ATTGTTGGCC GTGCGCTGCC 50 

AGATGTCCGA GATGGCCTGA AGCCGGTACA CCGTCGCGTA CTTTACGCCA 100 

TGAACGTACT AGGCAATGAC TGGAACAAAG CCTATAAAAA ATCTGCCCGT 150 

GTCGTTGGTG ACGTAATCGG TAAATACCAT CCCCATGGTG ACTCGGCGGT 2 00 

CTATGACACG ATCGTCCGCA TGGCGCAGCC ATTCTCGCTG CGTTATATGC 250 

TGGTAGACGG TCAGGGTAAC TTCGGTTCTA TCGACGGCGA CTCTGCGGCG 300 

GCAATGCGTT ATACGGAAAT CCGTCTGGCG AAAATTGCCC ATGAACTGAT 350 

GGCCG 355 



2) INFORMATION FOR SEQ ID NO: 2015 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 22 bases 

(B) TYPE: Nucleic acid 

(C) STRANDEDNESS: Single 

(D) TOPOLOGY: Linear 

(ii) MOLECULE TYPE: DNA 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 2015 
CCAAGAAGCT CAAAAACATC TG 22 



2) INFORMATION FOR SEQ ID NO: 2016 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 20 bases 

(B) TYPE: Nucleic acid 

(C) STRANDEDNESS: Single 

(D) TOPOLOGY: Linear 

(ii) MOLECULE TYPE: DNA 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 2016 
TADCCTGTCC AWACAGCCAT 20 



2) INFORMATION FOR SEQ ID NO: 2017 

(i) SEQUENCE CHARACTERISTICS: 
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(A) LENGTH: 22 bases 

(B) TYPE: Nucleic acid 

(C) STRANDEDNESS: Single 

(D) TOPOLOGY: Linear 

(ii) MOLECULE TYPE: DNA 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 2017 
ACTTTGAATA AGGTCGGTCT AG 



2) INFORMATION FOR SEQ ID NO: 2018 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 22 bases 

(B) TYPE: Nucleic acid 

(C) STRANDEDNESS: Single 

(D) TOPOLOGY: Linear 

(ii) MOLECULE TYPE: DNA 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 2018 
ACACTAAACA AGGTTGGTTT AG 



2) INFORMATION FOR SEQ ID NO: 2019 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 22 bases 

(B) TYPE: Nucleic acid 

(C) STRANDEDNESS: Single 

(D) TOPOLOGY: Linear 

(ii) MOLECULE TYPE: DNA 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 2019 
ACACTAAACA AGGTCGGTCT AG 



2) INFORMATION FOR SEQ ID NO: 2020 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 22 bases 

(B) TYPE: Nucleic acid 

(C) STRANDEDNESS: Single 

(D) TOPOLOGY: Linear 

(ii) MOLECULE TYPE: DNA 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 2020 
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22 



2) INFORMATION FOR SEQ ID NO: 2021 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 22 bases 

(B) TYPE: Nucleic acid 

(C) STRANDEDNESS: Single 

(D) TOPOLOGY: Linear 

(ii) MOLECULE TYPE: DNA 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 2021 
GTAGCTCCAG ACGAAATGTT TG 22 



2) INFORMATION FOR SEQ ID NO: 2022 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 22 bases 

(B) TYPE: Nucleic acid 

(C) STRANDEDNESS: Single 

(D) TOPOLOGY: Linear 

(ii) MOLECULE TYPE: DNA 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 2022 
GTAGCTCCAG ATGAAACGTT TG 22 



2) INFORMATION FOR SEQ ID NO: 2023 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 22 bases 

(B) TYPE: Nucleic acid 

(C) STRANDEDNESS: Single 

(D) TOPOLOGY: Linear 

(ii) MOLECULE TYPE: DNA 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 2023 
GTAACTCCAG ATGAAATGTT TG 22 



2) INFORMATION FOR SEQ ID NO: 2024 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 20 bases 

(B) TYPE: Nucleic acid 
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(C) STRANDEDNESS: Single 

(D) TOPOLOGY: Linear 

(ii) MOLECULE TYPE: DNA 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 2024 
AGTGAAAAGA TGGCTGCTGC 



2) INFORMATION FOR SEQ ID NO: 2025 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 20 bases 

(B) TYPE: Nucleic acid 

(C) STRANDEDNESS: Single 

(D) TOPOLOGY: Linear 

(ii) MOLECULE TYPE: DNA 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 2025 
AGTGAGAAAA TGGCTGCTGC 



2) INFORMATION FOR SEQ ID NO: 2026 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 23 bases 

(B) TYPE: Nucleic acid 

(C) STRANDEDNESS: Single 

(D) TOPOLOGY: Linear 

(ii) MOLECULE TYPE: DNA 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 2026 
TCCAAGCATG CATTATGCAA ACG 



2) INFORMATION FOR SEQ ID NO: 2027 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 23 bases 

(B) TYPE: Nucleic acid 

(C) STRANDEDNESS: Single 

(D) TOPOLOGY: Linear 

(ii) MOLECULE TYPE: DNA 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 2027 
TCGGTCTAGA TAGAGCTAAA ACG 
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2) INFORMATION FOR SEQ ID NO: 2028 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 20 bases 

(B) TYPE: Nucleic acid 

(C) STRANDEDNESS: Single 

(D) TOPOLOGY: Linear 

(ii) MOLECULE TYPE: DNA 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 2028 
TATGCTCTTC AACAATCACG 



2) INFORMATION FOR SEQ ID NO: 2029 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 22 bases 

(B) TYPE: Nucleic acid 

(C) STRANDEDNESS: Single 

(D) TOPOLOGY: Linear 

(ii) MOLECULE TYPE: DNA 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 2029 
AGCCGTTGAG ACTTTGAATA AG 



2) INFORMATION FOR SEQ ID NO: 2030 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 19 bases 

(B) TYPE: Nucleic acid 

(C) STRANDEDNESS: Single 

(D) TOPOLOGY: Linear 

(ii) MOLECULE TYPE: DNA 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 2030 
CTTAATGGTC TTGGTATCG 



2) INFORMATION FOR SEQ ID NO: 2031 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 22 bases 

(B) TYPE: Nucleic acid 

(C) STRANDEDNESS: Single 
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(D) TOPOLOGY: Linear 

(ii) MOLECULE TYPE: DNA 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 2031 
CGTGACTGGG GTTCTGCTAT GA 



2) INFORMATION FOR SEQ ID NO:2032 

(i) SEQUENCE CHARACTERISTICS: 

(A) . LENGTH: 22 bases 

(B) TYPE: Nucleic acid 

(C) STRANDEDNESS: Single 

(D) TOPOLOGY: Linear 

(ii) MOLECULE TYPE: DNA 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 2032 
CGTGACTGGG GATCATCAAT GA 



2) INFORMATION FOR SEQ ID NO: 2033 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 22 bases 

(B) TYPE: Nucleic acid 

(C) STRANDEDNESS: Single 

(D) TOPOLOGY: Linear 

(ii) MOLECULE TYPE: DNA 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 2033 
CGTGACTGGG GTTCTGCCAT GA 



2) INFORMATION FOR SEQ ID NO: 2034 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 22 bases 

(B) TYPE: Nucleic acid 

(C) STRANDEDNESS: Single 

(D) TOPOLOGY: Linear 

(ii) MOLECULE TYPE: DNA 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 2034 
ATCAAGAACA CTGGCTATGT AG 
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2) INFORMATION FOR SEQ ID NO: 2035 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 22 bases 

(B) TYPE: Nucleic acid 

(C) STRANDEDNESS: Single 

(D) TOPOLOGY: Linear 

(ii) MOLECULE TYPE: DNA 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 2035 
ATCAAGAACA CTGGCTACGT AG 



2) INFORMATION FOR SEQ ID NO: 2036 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 22 bases 

(B) TYPE: Nucleic acid 

(C) STRANDEDNESS: Single 

(D) TOPOLOGY: Linear 

(ii) MOLECULE TYPE: DNA 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 2036 
ATCAAGAACA CTGGTTACGT AG 



2) INFORMATION FOR SEQ ID NO: 2037 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 22 bases 

(B) TYPE: Nucleic acid 

(C) STRANDEDNESS: Single 

(D) TOPOLOGY: Linear 

(ii) MOLECULE TYPE: DNA 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 2037 
ATCAAATU^TA CTGGTTATGT AG 



2) INFORMATION FOR SEQ ID NO: 2038 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 22 bases 

(B) TYPE: Nucleic acid 

(C) STRANDEDNESS: Single 
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(D) TOPOLOGY: Linear 

(ii) MOLECULE TYPE: DNA 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 2038 

ATCAAGAATA CTGGCTACGT AG 

2) INFORMATION FOR SEQ ID NO: 2039 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 22 bases 

(B) TYPE: Nucleic acid 

(C) STRANDEDNESS: Single 

(D) TOPOLOGY: Linear 

(ii) MOLECULE TYPE: DNA 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 2039 
ATCAAAAACA CTGGCTATGT AG 



2) INFORMATION FOR SEQ ID NO: 2040 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 19 bases 

(B) TYPE: Nucleic acid 

(C) STRANDEDNESS: Single 

(D) TOPOLOGY: Linear 

(ii) MOLECULE TYPE: DNA 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 2040 
TGTGACCCCA GACAAACCC 



2) INFORMATION FOR SEQ ID NO: 2041 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 20 bases 

(B) TYPE: Nucleic acid 

(C) STRANDEDNESS: Single 

(D) TOPOLOGY: Linear 

(ii) MOLECULE TYPE: DNA 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 2041 
GTTGAGCGGC AGCACTATCT 



2) INFORMATION FOR SEQ ID NO: 2042 
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(i) SEQUENCE CHAiy^CTERISTICS: 

(A) LENGTH: 20 bases 

(B) TYPE: Nucleic acid 

(C) STRANDEDNESS: Single 

(D) TOPOLOGY: Linear 

(ii) MOLECULE TYPE: DNA 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 2042 
CACGGGGATT TCTCTATTTA 20 



2) INFORMATION FOR SEQ ID NO: 2043 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 20 bases 

(B) TYPE: Nucleic acid 

(C) STRANDEDNESS: Single 

(D) TOPOLOGY: Linear 

(ii) MOLECULE TYPE: DNA 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 2043 
CACGGGGATT ACTCTATTTA 20 



2) INFORMATION FOR SEQ ID NO: 2044 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 20 bases 

(B) TYPE: Nucleic acid 

(C) STRANDEDNESS: Single 

(D) TOPOLOGY: Linear 

(ii) MOLECULE TYPE: DNA 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 2044 
ACCGTAAGTC GGCCAAGTCA 20 



2) INFORMATION FOR SEQ ID NO: 2045 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 19 bases 

(B) TYPE: Nucleic acid 

(C) STRANDEDNESS: Single 

(D) TOPOLOGY: Linear 

(ii) MOLECULE TYPE: DNA 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 2045 
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GTTCTTTCTC CGTATCGTC 



2) INFORMATION FOR SEQ ID NO: 2046 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 20 bases 

(B) TYPE: Nucleic acid 

(C) STRANDEDNESS: Single 

(D) TOPOLOGY: Linear 

(ii) MOLECULE TYPE: DNA 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 204 6 
ACGGGGATTT TTCTATCTAT 



2) INFORMATION FOR SEQ ID NO: 2047 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 2160 bases 

(B) TYPE: Nucleic acid 

(C) STRANDEDNESS: Double 

(D) TOPOLOGY: Linear 

(ii) MOLECULE TYPE: Genomic DNA 

(vi) ORIGINAL SOURCE: 

(A) ORGANISM: Streptococcus pneumoniae 

(B) STRAIN: CS109 

(C) ACCESSION NUMBER: Z49094 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 2047 

ATGAACAAAC CAACGATTCT GCGCCTAATC AAGTATCTGA GCATTAGCTT 50 

CTTT^GCTTG GTTATCGCAG CCATTGTCTT AGGCGGAGGA GTTTTTTTCT 100 

ACTATGTTAG CAAGGCTCCT AGCCTATCCG AGAGTAAACT AGTTGCAACA 150 

ACCTCTAGTA AAATCTACGA CAATAAAAAT CAACTCATTG CTGACTTGGG 200 

TTCTGAACGC CGCGTCAATG CCCAAGCTAA TGATATTCCC ACAGATTTGG 250 

TTAAGGCAAT CGTTTCTATC GAAGACCATC GCTTCTTCGA CCACAGGGGG 3 00 

ATTGATACCA TCCGTATCCT GGGAGCTTTC TTGCGCAATC TGCAAAGCAA 3 50 

TTCCCTCCAA GGTGGATCAA CTCTCACCCA ACAGTTGATT AAATTGACTT 400 

ACTTTTCAAC CTCGACTTCC GACCAGACTA TTTCTCGTAA GGCTCAGGAA 4 50 

GCTTGGTTAG CGATTCAGTT AGAACAAAAA GCAACCAAAC AGGAAATCTT 500 

GACCTACTAT ATAAATAAGG TCTACATGTC TAATGGCAAC TATGGAATGC 550 

AGACAGCAGC TCAAAACTAC TATGGTAAAG ACCTCAATAA TTTAAGTTTA 6 00 

CCTCAGTTAG CCTTGCTGGC TGGAATGCCT CAGGCACCAA ACCAATATGA 650 

CCCCTATTCA CATCCAGAAG CAGCCCAAGA CCGCCGAAAC TTGGTCTTAT 700 

CTGAAATGAA AAATCAAGGT TACATCTCTG CTGAACAGTA TGAGAAAGCA 750 

GTCAATACAC CAATTACTGA TGGACTACAA AGTCTCAAAT CAGCAAGTAA 800 

TTACCCTGCT TACATGGATA ATTACCTCAA GGAAGTCATC AATCAAGTTG 850 

AAGAAGAAAC TGGCTATAAC CTTCTAACTA CTGGGATGGA TGTTTACACA 900 
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AATGTAGACC /SiAGT^GCTCA AAAACATCTG 
TCAATACGTC TCTTACCCTG ACGATGATTT 
TAGATGTTTC AAATGGTAAA GTCATCGCAC 
GCAAGTT^TG TTTCATTCGG TACCAACCAG 
CTGGGGATCA TCAATGAAAC CAATCACTGA 
ATGGAGTCTA TGACTCTACT GCTTCTATTG 
TATCCTGGCA CTGATACTCC ACTCTACAAC 
AAACATTACA ATCCAGTATG CTCTTCAACA 
TTGAGACTTT GAATAAGGTC GGTCTAGATA 
GGTCTTGGTA TCGACTATCC AAGCATGCAT 
TAACACAACT GAATCCAACA AAAAATATGG 
CTGCTGCCTA CGCTGGTTTT GCTAATGGTG 
TATATCAATA AAATCGTCTT TAGTGATGGT 
TGCTGGTACA CGAGCTATGA AAGAGACTAC 
TGATGAAAAC TGTTTTAACT TACGGAACAG 
TGGCTTCCAC AAGCAGGTAA GACAGGTACT 
AATTGAAAAG TATATCAAGA ACACTGGTTA 
TTGTAGGGTA TACCCGTAAA TATGCAATGG 
AATCGTCTAA CTCCAATCAT CGGAGATGGT 
CTATCGTTCA ATGATAACTT ACCTTTCTGA 
GGACAATGCC AGATGGCTTG TATAGAAATG 
GGTGCTCGTT CTACGTGGAG CTCACCTGCT 
TGAAAGTTCA AGCTCATCAT CAGATAGTTC 
CCACTCCAAG CACAAATAAT AGTACGACTA 
CAACAATCAA ATACAACCCC TGATCAACAA 
ACAACCATAA 



PCT/CAOO/01150 

TGGGATATCT ACAACTCCGA 950 

GCAAGTCGCA TCTACGGTCG 100 0 

AACTTGGTGC TCGTCATCAA 1050 

GCCGTAGAAA CCAATCGTGA 1100 

CTATGCTCCC GCTTTAGAAT 1150 

TACATGATGT CCCTTATAAC 12 00 

TGGGATCATG TCTACTTTGG 12 50 

ATCACGAAAT GTCACAGCCG 13 0 0 

GAGCTAAAAC CTTCCTTAAT 13 50 

TATGCAAACG CCATTTCAAG 14 00 

TGCAAGTAGT GAAAAAATGG 1450 

GTATTTATCA CAAACCAATG 1500 

AGCGAAAAAG AATTTTCTGA 1550 

TGCCTATATG ATGACTGAAA 1600 

GACGTGGAGC CTACCTACCA 1650 

TCTAACTATA CTGACGAAGA 170 0 

CGTAGCTCCA GATGAAATGT 1750 

CTGTTTGGAC AGGATACTCA 1800 

TTCCTTGTTG CTGGTAAAGT 1850 

AGATGACCAA CCTGGAGATT 1900 

GAGAATTCGT ATTTAAAAAT 1950 

CCACAACAAC CCCCATCAAC 200 0 

AACTTCACAG TCTAACTCAA 2050 

CCAATCCTAA CAATAATACG 2100 
AATCAGAATC CTCAACCAGC . 215 0 

2160 



2) INFORMATION FOR SEQ ID NO: 2048 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 2160 bases 

(B) TYPE: Nucleic acid 

(C) STRANDEDNESS: Double 

(D) TOPOLOGY: Linear 

(ii) MOLECULE TYPE: Genomic DNA 

(vi) ORIGINAL SOURCE: 

(A) ORGANISM: Streptococcus pneumoniae 

(B) STRAIN: R6 

(C) ACCESSION NUMBER: M90527 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 2048 

ATGAACAAAC CAACGATTCT GCGCCTAATC AAGTATCTGA GCATTAGCTT 50 

CTTAAGCTTG GTTATCGCAG CCATTGTCTT AGGCGGAGGA GTTTTTTTCT 10 0 

ACTACGTTAG CAAGGCTCCT AGCCTATCCG AGAGTAAACT AGTTGCAACA 150 

ACTTCTAGTA AAATCTACGA CAATAAAAAT CAACTCATTG CTGACTTGGG 200 

TTCTGAACGC CGCGTCAATG CCCAAGCTAA TGATATTCCC ACAGATTTGG 250 

TTAAGGCAAT CGTTTCTATC GAAGACCATC GCTTCTTCGA CCACAGGGGG 3 00 

ATTGATACCA TCCGTATCCT GGGAGCTTTC TTGCGCAATC TGCAAAGCAA 350 

TTCCCTCCAA GGTGGATCAG CTCTCACTCA ACAGTTGATT AAGTTGACTT 400 

ACTTTTCAAC TTCGACTTCC GACCAGACTA TTTCTCGTAA GGCTCAGGAA 450 

GCTTGGTTAG CGATTCAGTT AGAACAAAAA GCAACCAAGC AAGAAATCTT 500 

GACCTACTAT ATAAATAAGG TCTACATGTC TAATGGGAAC TATGGAATGC 550 
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AGACAGCAGC TCAAAACTAC TATGGTAAAG 
CCTCAGTTAG CCTTGCTGGC TGGAATGCCT 
CCCCTATTCA CATCCAGAAG CAGCCCAAGA 
CTGAAATGAA AAATCAAGGC TACATCTCTG 
GTCAATACAC CAATTACTGA TGGGCTACAA 
TTACCCTGCT TACATGGATA ATTACCTCAA 
AAGAAGAAAC AGGCTATAAC CTACTCACAA 
AATGTAGACC AAGAAGCTCA AAAACATCTG 
CGAATACGTT GCCTATCCAG ACGATGAATT 
TTGATGTTTC TAACGGTAAA GTCATTGCCC 
TCAAGTAATG TTTCCTTCGG AATTAACCAA 
CTGGGGATCA ACTATGAAAC CGATCACAGA 
ACGGTGTCTA CGAGTCAACT GCCACTATCG 
TACCCTGGGA CAAATACCCC TGTTTATAAC 
CAACATCACC TTGCAATACG CCCTGCAACA 
TGGAAACTCT AAACAAGGTC GGACTCAACC 
GGTCTCGGAA TCGACTACCC AAGTATTCAC 
TAACACAACC GAATCAGACA AAAT^TATGG 
CTGCTGCTTA CGCTGCCTTT GCAAATGGTG 
TATATCCATA AAGTCGTCTT TAGTGATGGG 
TGTCGGAACT CGTGCCATGA AAGAAACGAC 
TGATGAAAAC AGTCTTGAGT TATGGT^CTG 
TGGCTCCCTC AGGCTGGTAA AACAGGAACC 
AATTGAAAAC CACATCAAGA CCTCTCAATT 
TTGCTGGCTA TACGCGTAAA TATTCAATGG 
AACCGTCTGA CACCACTTGT AGGCAATGGC 
TTACCGCTCT ATGATGACCT ACCTGTCTGA 
GGAATATACC AGAGGGGCTC TACAGAAATG 
GGTGCTCGTT CTACGTGGAG CTCACCTGCT 
TGAAAGTTCA AGCTCATCAT CAGATAGTTC 
CCACTCCAAG CACAAATAAT AGTACGACTA 
CAACAATCAA ATACAACCCC TGATCAACAA 
ACAACCATAA 
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ACCTCAATAA TTTAAGTTTA 6 00 

CAGGCACCAA ACCAATATGA 6 50 

CCGCCGAAAC TTGGTCTTAT 700 

CTGAACAGTA TGAGAAAGCA 750 

AGTCTCAAAT CAGCAAGTAA 800 

GGAAGTCATC AATCAAGTTG 850 

CTGGGATGGA TGTCTACACA 90 0 

TGGGATATTT ACAATACAGA 950 

GCAAGTCGCT TCTACCATTG 1000 

AGCTAGGAGC ACGCCATCAG 1050 

GCAGTAGAAA CAAACCGCGA 1100 

CTATGCTCCT GCCTTGGAGT 1150 

TTCACGATGA GCCCTATAAC 12 00 

TGGGATAGGG GCTACTTTGG 12 50 

ATCGCGAAAC GTCCCAGCCG 13 00 

GCGCCAAGAC TTTCCTAT^T 13 50 

TACTCAAATG CCATTTCAAG 1400 

AGCAA6TAGT GAAAAGATGG 1450 

GAACTTACTA TAAACCAATG 150 0 

AGTGAAAAAG AGTTCTCTAA 1550 

AGCCTATATG ATGACCGACA 16 00 

GACGAAATGC CTATCTTGCT 1650 

TCTAACTATA CAGACGAGGA 1700 

TGTAGCACCT GATGAACTAT 1750 

CTGTATGGAC AGGCTATTCT 1800 

CTTACGGTCG CTGCCAAAGT 1850 

AGGAAGCAAT CCAGAGGATT 1900 

GAGAATTCGT ATTTAAAAAT 1950 

CCACAACAAC CCCCATCAAC 2 00 0 

AACTTCACAG TCTAGCTCAA 2 050 

CCAATCCTAA CAATAATACG 2100 

AATCAGAATC CTCAACCAGC 2150 

2160 



2) INFORMATION FOR SEQ ID NO: 2049 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 2160 bases 

(B) TYPE: Nucleic acid 

(C) STRANDEDNESS : Double 

(D) TOPOLOGY: Linear 

(ii) MOLECULE TYPE: Genomic DNA 

(vi) ORIGINAL SOURCE: 

(A) ORGANISM: Streptococcus pneumoniae 

(B) STRAIN: URU-E159 

(C) ACCESSION NUMBER: AF139890 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO:2049 

ATGAACAAAC CAACGATTCT GCGCCTi\ATC AAGTATCTGA GCATTAGCTT 50 

CTTAAGCTTG GTTATCGCAG CCATTGTCTT AGGCGGAGGA GTTTTTTTCT 100 

ACTACGTTAG CAAGGCTCCT AGCCTATCCG AGAGTAAACT AGTTGCAACA 150 

ACTTCTAGTA AAATCTACGA CAATAAAAAT CAACTCATTG CTGACTTGGG 200 
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TTCTGAACGC CGCGTCAATG 
TTAAGGCAAT CGTTTCTATC 
ATTGATTCCA TCCGTATCCT 
TTCCCTCCAA GGTGGATCAA 
ACTTTTCAAC CTCGACTTCC 
GCTTGGTTAG CGATTCAGTT 
GACCTACTAT ATAAATAAGG 
AGACAGCAGC TCAAAACTAC 
CCTCAGTTAG CCTTGCTGGC 
CCCCTATTCA CATCCAGAAG 
CTGAAATGAA AAATCAAGGT 
GTCAATACAC CAATTACTGA 
TTACCCTGCT TACATGGATA 
AACAAGAAAC TGGCTATAAC 
AATGTAGACC AAGAAGCTCA 
TCAATACGTC TCTTACCCTG 
TAGATGTTTC AAATGGTAAA 
GCAAGTAACG TTTCATTTGG 
CTGGGGTTCT GCTATGAAAC 
ACGGTGTTTA TGATTCCACT 
TATCCGGGAA CAAGCACACC 
TAATATTACT CTGCAATATG 
TTGAGACTTT GAATAAGGTC 
GGTCTTGGTA TCGACTATCC 
TAATACAACA GAATCTAATA 
CTGCTGCTTA TGCTGCCTTT 
TATATCCATA AAGTCGTCTT 
TGTCGGAACT CGTGCCATGA 
TGATGAAAAC AGTCTTGACT 
TGGCTTCCTC AAGCTGGTAA 
AGTTGAAAAC CACATCAAGA 
TTGTTGGTTA TACTCGTAAG 
AATCGTTTAA CTCCTATCGT 
TTATCGCTCA ATGATAACGT 
GGACGATGCC AGACGGACTT 
GGTGCTCGTT CTACGTGGAA 
TGAAAGTTCA AGCTCATCAT 
CCACTCCAAG CACAAATAAT 
CAACAATCAA ATACAACCCC 
ACAACCATAA 



CCCAAGCTAA TGATATTCCC 
GAAGACCATC GCTTCTTCGA 
GGGAGCTTTC TTGCGCAATC 
CTCTCACCCA ACAGTTGATT 
iGACCAGACTA TTTCTCGTAA 
AGAACAAAAA GCAACCAAAC 
TCTACATGTC TAATGGCAAC 
TATGGTAAAG ACCTCAATAA 
TGGAATGCCT CAGGCACCAA 
CAGCCCAAGA CCGCCGAAAC 
TACATCTCTG CTGAACAGTA 
TGGACTACAA AGTCTCAAAT 
ATTACCTCAA GGAGGTCATC 
CTTCTAACTA CTGGGATGGA 
AAi\ACATCTG TGGGATATCT 
ACGATGATTT GCAAGTCGCA 
GTCATCGCCC AACTTGGAGC 
TACCAACCAA GCTGTGGAAA 
CAATCACCGA TTATGCACCT 
GCAACTATGG TTAATGATAT 
TGTCTACAAC TGGGATAGAG 
CTCTTCAACA ATCACGAAAT 
GGTCTAGATA GAGCTAAAAC 
AAGCATGCAT TATGCAAACG 
AACAATACGG AGCAAGTAGT 
GCAAATGGTG GCACTTACTA 
CAGTGATGGA AGTAAAAAAG 
AGGAAACGAC AGCCTATATG 
TATGGAACTG GGCGTGGAGC 
AACAGGAACC TCTAACTATA 
ACACTGGCTA TGTAGCTCCA 
TATTCTATGG CTGTATGGAC 
TGGAGATGGT TTCCTAGTTG 
ATCTATCAGA AGATACTCAT 
TTCAGAAATG GAGAATTCGT 
CTCACCTGCT CCACAACAAC 
CAGATAGTTC AACTTCACAG 
AGTACGACTA CCAATCCTAA 
TGATCAACAA AATCAGAATC 



ACAGATTTGG 


250 


CCACAGGGGG 


300 


TGCAAAGTAA 


350 


AAGTTGACTT 


400 


GGCTCAGGAA 


450 


AGGAAATCTT 


500 


TATGGAATGC 


550 


TTTAAGTTTA 


600 


ACCAATATGA 


650 


TTGGTCTTAT 


700 


TGAGAAAGCA 


750 


CAGCAAGTAA 


800 


AATCAAGTAG 


850 


TGTTTACACA 


900 


ACAACTCCGA 


950 


TCTACGGTCG 


1000 


TCGTCACCAA 


1050 


CCAATCGTGA 


1100 


GCCATAGAAT 


1150 


TCCTTATAAC 


1200 


CATATTTCGG 


1250 


GTCACAGCCG 


1300 


CTTCCTTAAT 


1350 


CCATTTCAAG 


1400 


GAAAAAATGG 


1450 


TAAACCAATG 


1500 


AGTTCTCTAA 


1550 


ATGACCGACA 


1600 


CTATCTTCCT 


1650 


CAGATGAGGA 


1700 


GATGAAATGT 


1750 


AGGTTATTCG 


1800 


CAGCTAAAGT 


1850 


CCAGAAGACT 


1900 


ATTTAAAAAT 


1950 


CCCCATCAAC 


2000 


TCTAGCTCAA 


2050 


CAATAATACG 


2100 


CTCAACCAGC 


2150 




2160 



2) INFORMATION FOR SEQ ID NO: 2050 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 930 bases 

(B) TYPE: Nucleic acid 

(C) STRANDEDNESS: Double 

(D) TOPOLOGY: Linear 

(ii) MOLECULE TYPE: Genomic DNA 

(vi) ORIGINAL SOURCE: 

(A) ORGANISM: Streptococcus pneumoniae 

(B) STRAIN: 8303; 35193 

(C) ACCESSION NUMBER: AF046230 
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(xi) SEQUENCE DESCRIPTION: SEQ 

CTGTGGGATA TCTACAACTC CGATCAATAC 
TTTGCAAGTC GCATCTACGG TCGTAGATGT 
CCCAACTTGG AGCTCGTCAC CAAGCAAGTA 
CAAGCTGTGG AAACCAATCG TGACTGGGGT 
CGATTATGCA CCTGCCATAG AATACGGTGT 
TGGTTAATGA TATTCCTTAT AACTATCCGG 
AACTGGGATA GAGCATATTT CGGTAATATT 
ACAATCACGA AATGTCACAG CCGTTGAGAC 
ATAGAGCTAA AACCTTCCTT AATGGTCTTG 
CATTATGCAA ACGCCATTTC AAGTAATACA 
CGGAGCAAGT AGTGAAAAAA TGGCTGCTGC 
GTGGCACTTA CTATAAACCA ATGTATATCC 
GGAAGTAAAA AAGAGTTCTC TAATGTCGGA 
GACAGCCTAT ATGATGACCG ACATGATGAA 
CTGGGCGTGG AGCCTATCTT CCTTGGCTTC 
ACCTCTAACT ATACAGATGA GGAAGTTGAA 
CTATGTAGCT CCAGATGAAA TGTTTGTTGG 
TGGCTGTATG GACAGGTTAT TCGAATCGTT 
GGTTTCCTAG TTGCAGCTAA AGTTTATCGC 



ID NO: 2050 

GTCTCTTACC CTGACGATGA 50 

TTCAAATGGT AAAGTCATCG 100 

ACGTTTCATT TGGTACCAAC 150 

TCTGCTATGA AACCAATCAC 200 

TTATGATTCC ACTGCAACTA 250 

GAACAAGCAC ACCTGTCTAC 3 00 

ACTCTGCAAT ATGCTCTTCA 350 

TTTGAATAAG GTCGGTCTAG 40 0 

GTATCGACTA TCCAAGCATG 4 50 

ACAGAATCTA ATAAACAATA 500 

TTATGCTGCC TTTGCAAATG 550 

ATAAAGTCGT CTTCAGTGAT 600 

ACTCGTGCCA TGAAGGAAAC 650 

AACAGTCTTG ACTTATGGAA 700 

CTCAAGCTGG TAAAACAGGA 750 

AACCACATCA AGAACACTGG 80 0 

TTATACTCGT AAGTATTCTA 850 

TAACTCCTAT CGTTGGAGAT 900 

930 



2) INFORMATION FOR SEQ ID NO: 2051 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 930 bases 

(B) TYPE: Nucleic acid 

(C) STRANDEDNESS: Double 

(D) TOPOLOGY: Linear 

(ii) MOLECULE TYPE: Genomic DNA 

(vi) ORIGINAL SOURCE: 

(A) ORGANISM: Streptococcus pneumoniae 

(B) STRAIN: 63509; Mil 

(C) ACCESSION NUMBER: AF04 6238 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 2051 

CTGTGGGATA TTTACAATAC AGACGAATAC GTTGCCTATC CAGATGATGA 50 

CATGCAAGTA GCTTCGACTA TTGTAGATGT ATCTAATGGT AACGTTATTG 100 

CACAACTTGG TGCTCGTCAT CTU^GCAAGTA ATGTTTCATT CGGCACCAAC 150 

CAGGCTGTGG AGACCAATCG TGACTGGGGT TCTTCTATGA AACCAATCAC 200 

TGACTATGCT CCCGCTTTAG AATATGGAGT CTATGACTCT ACTGCTTCTA 250 

TTGTACATGA TGTTCCTTAT AACTATCCTG GCACTGATAC TCCAGTCTAC 3 00 

AACTGGGATC ATGTCTACTT TGGAAACATT ACAATCCAGT ATGCTCTTCA 350 

ACAATCACGA AATGTCACAG CCGTTGAGAC TTTGAATAAG GTCGGTCTAG 400 

ATAGAGCTAA AACCTTCCTT AATGGTCTTG GTATCGACTA TCCAAGCATG 450 

CATTATGCAA ACGCCATTTC AAGTAACACA ACTGAATCCA ACAAACAGTA 500 

CGGTGCAAGT AGTGAAAAGA TGGCTGCTGC TTATGCCGCC TTTTCTAATG 550 

GTGGTATTTA CCACAAACCA ATGTATATCA ATAAAATCGT CTTCAGTGAT 600 

GGTAGTGAAA AAGAATTTTC TGATGCTGGT ACACGAGCTA TGAAAGAAAC 650 

TACTGCCTAT ATGATGACCG AAATGATGAA AACTGTTTTA GCTTACGGAA 700 

CCGGACGTGG TGCTTACCTA CCATGGCTTC CACAAGCAGG TAAGACAGGT 750 
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ACTTCTAACT ATACTGACGA 
CTACGTAGCT CCAGATGAAA 
TGGCTGTATG GACTGGTTAC 
GGTTTCCTAG TTGCTGCTAA 



CGAAATTGAA AAGTATATCA 
TGTTTGTTGG TTATACTCGT 
TCAAATCGTT TAACTCCAAT 
GGTTTATCGC 



PCT/CAOO/01150 

AGAACACTGG 80 0 

AAGTATTCTA 8 50 

CGTAGGAGAT 900 

930 



2) INFORMATION FOR SEQ ID NO: 2052 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 1195 bases 

(B) TYPE: Nucleic acid 

(C) STRANDEDNESS: Double 

(D) TOPOLOGY: Linear 

(ii) MOLECULE TYPE: Genomic DNA 

(vi) ORIGINAL SOURCE: 

(A) ORGANISM: Streptococcus pneumoniste 

(B) STRAIN: #22/HA5 

(C) ACCESSION NUMBER: AB006877 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 2052 

GTCTCTTACC CTGACGATGA TTTGCAAGTC GCATCTACGG TCGTAGATGT 50 

TTCCAATGGT AAAGTCATCG CCCAACTTGG AGCTCGTCAC CAAGCAAGTA 100 

ACGTTTCATT TGGTACCAAC CAAGCTGTGG AAACCAATCG TGACTGGGGT 150 

TCAACAATGA AACCAATCAC CGATTATGCA CCTGCCATAG AATACGGTGT 200 

ATATGATTCC ACTGCAACTA TGGTTAATGA TATTCCTTAT AACTATCCGG 250 

GAACAAGCAC ACCTGTCTAC AACTGGGATC GAGCATATTT TGGTAATATT 3 00 

ACTCTGCAAT ATGCCCTTCA ACAATCACGT AATGTTACAG CAGTTGAAAC 3 50 

ATTAAACAAG GTTGGTTTAG ATCGAGCCAA AACTTTCCTA AATGGCTTAG 4 00 

GTATCGATTA CCCAAGCATA CACTATGCTA ACGCCATTTC AAGTAATACA 4 50 

ACAGAATCTA ACAAACATTA CGGTGCAAGT AGTGAAAAAA TGGCTGCTGC 50 0 

TTATGCCGCC TTTGCTAATG GTGGTATTTA CCACAAACCA ATGTATATCA 550 

ATAAAATCGT CTTTAGTGAT GGTAGTGAGA AAGAATTTTC TGATGCTGGT 60 0 

ACACGAGCTA TGAAAGAAAC TACTGCCTAT ATGATGACCG AAATGATGAA 650 

AACTGTTTTA GCTTACGGAA CCGGACGTGG TGCTTACCTA CCATGGCTTC 700 

CACAAGCAGG TAAGACAGGT ACTTCTAACT ATACTGATGA CGAAATCGAA 75 0 

AAACATATCA AGAATACTGG CTACGTAGCT CCAGATGAAA TGTTTGTAGG 800 

CTATACTCGT AAGTATTCTA TGGCTGTATG GACTGGTTAC TCAAATCGTT 850 

TAACTCCAAT TGTTGGAGAT GGTTTCCTAG TTGCTGCTAA GGTTTATCGC 900 

TCAATGATAT CGTATCTATC AGAAGATGAC CAACCTGGAG ATTGGACGAT 950 

GCCAGACGGC CTGTTCCGAA ATGGAGAATT CGTATTTAAA AATGGTGCTC 1000 

GTTCTACGTG GAACTCACCT GCTCCACAAC AACCCCCATC AACTGAAAGT 1050 

TCAAGCTCAT CATCAGATAG TTCAACTTCA CAGTCTAGCT CAACCACTCC 1100 

AAGCACAAAT AATAGTACGA CTACCAATCC TAACAATAAT ACGCAACAAT 1150 

CT^AATACAAC CCCTGATCAA CAAAATCAGA ATCCTCAACC AGCAC 1195 



2) INFORMATION FOR SEQ ID NO: 2053 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 930 bases 

(B) TYPE: Nucleic acid 

(C) STRANDEDNESS: Double 
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(ii) MOLECULE TYPE: Genomic DNA 

(vi) ORIGINAL SOURCE: 

(A) ORGANISM: Streptococcus pneumoniae 

(B) STRAIN: 17619 

(C) ACCESSION NUMBER: AF04 6237 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 2053 

CTGTGGGATA TTTACAATAC AGACGAATAC GTTGCCTATC CAGACGATGA 50 

ATTGCAAGTC GCTTCTACCA TTGTTGATGT TTCTAACGGT AAAGTCATTG 100 

CCCAGCTAGG AGCACGCCAT CAGTCAAGTA ATGTTTCCTT CGGAATTAAC 15 0 

CAAGCAGTAG AAACAT^CCG CGACTGGGGA TCAACTATGA AACCGATCAC 200 

AGACTATGCT CCTGCCTTGG AGTACGGTGT CTACGATTCA ACTGCTACTA 250 

TCGTTCACGA TGAGCCCTAT AACTACCCTG GGACAAATAC TCCTGTTTAT 300 

AACTGGGATA GGGGCTACTT TGGCAACATC ACCTTGCAAT ACGCCCTGCA 350 

ACAATCGCGA AACGTCCCAG CCGTGGAAAC TCTAAACAAG GTCGGACTCA 4 00 

ACCGCGCCAA GACTTTCCTA AATGGTCTCG GAATCGACTA CCCAAGTATT 450 

CACTACTCAA ATGCCATTTC AAGTAACACA ACCGAATCAG ACAAAAAATA 500 

TGGAGCAAGT AGTGAAAAGA TGGCTGCTGC TTACGCTGCC TTTGCAAATG 550 

GTGGAACTTA CTATAAACCA ATGTATATCC ATAAAGTCGT CTTTAGTGAT 600 

GGGAGTGAAA AAGAGTTCTC TAATGTCGGA ACTCGTGCCA TGAAGGAAAC 650 

GACAGCCTAT ATGATGACCG AAATGATGAA AACAGTCTTG AGTTATGGAA 700 

CTGGACGAAA TGCCTATCTT GCTTGGCTTC CTCAAGCTGG TAAAACAGGT 750 

ACCTCTAACT ATACAGATGA AGAAATTGAA AAACACATCA AAAACACTGG 800 

CTATGTAGCT CCAGATGAAA CGTTTGTTGG TTATACTCGT AAGTATTCTA 85 0 

TGGCTGTATG GACAGGTTAC ACAAACCGTC TCACACCAAT TGTGGGGGAT 900 

GGCTTCACAG TTGCTGCCAA AGTTTACCGC 930 



2) INFOEIMATION FOR SEQ ID NO: 2054 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 306 bases 

(B) TYPE: Nucleic acid 

(C) STRANDEDNESS: Double 

(D) TOPOLOGY: Linear 

(ii) MOLECULE TYPE: Genomic DNA 

(vi) ORIGINAL SOURCE: 

(A) ORGANISM: Streptococcus pneumoniae 

(B) STRAIN: R6 

(C) ACCESSION NUMBER: Y07845 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 2054 

CGTCGCATTC TCTACGGAAT GAATGAATTG GGTGTGACCC CAGACAAACC 50 

CCATAAAAAA TCTGCTCGTA TTACAGGGGA TGTCATGGGT AAATACCACC 10 0 

CACACGGGGA TTCCTCTATT TATGAAGCCA TGGTCCGTAT GGCTCAATGG 150 

TGGAGCTACC GTTACATGCT TGTAGATGGT CATGGGAATT TTGGTTCCAT 200 

GGATGGAGAT AGTGCTGCCG CTCAACGTTA TACCGAGGCA CGTATGAGCA 250 

AGATTGCTCT GGAAATGCTT CGTGATATCA ACAAAAATAC AGTTGATTTC 300 

GTTGAT 306 
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2) INFORMATION FOR SEQ ID NO: 2055 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 2472 bases 

(B) TYPE: Nucleic acid 

(C) STRANDEDNESS: Double 

(D) TOPOLOGY: Linear 

(ii) MOLECULE TYPE: Genomic DNA 

(vi) ORIGINAL SOURCE: 

(A) ORGANISM: Streptococcus pneumoniae 

(B) STRAIN: 7785 

(C) ACCESSION NUMBER: Z67739 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 2055 

ATGTCTAACA TTCAAAACAT GTCCCTGGAG GACATCATGG GAGAGCGCTT 50 

TGGTCGCTAC TCCAAGTACA TTATTCAAGA CCGGGCTTTG CCAGATATTC 100 

GTGATGGGTT GAAGCCGGTT CAGCGCCGTA TTCTTTATTC TATGAATAAG 150 

GATAGCAATA CTTTTGACAA GAGCTACCGT AAGTCGGCCA AGTCAGTCGG 200 

GAACATCATG GGGAATTTCC ACCCACACGG GGATTCTTCT ATCTATGATG 2 50 

CCATGGTTCG TATGTCACAG AACTGGAAAA ATCGTGAGAT TCTAGTTGAA 3 00 

ATGCACGGTA ATAACGGTTC TATGGACGGA GATCCTCCTG CGGCTATGCG 350 

TTATACTGAG GCACGTTTGT CTGAAATTGC AGGCTACCTT CTTCAGGATA 400 

TCGAGAAAAA GACAGTTCCT TTTGCATGGA ACTTTGACGA TACGGAGAAA 4 50 

GAACCAACGG TCTTGCCAGC AGCCTTTCCA AACCTCTTGG TCAATGGTTC 500 

GACTGGGATT TCGGCTGGTT ATGCCACAGA CATTCCTCCC CATAATTTAG 550 

CTGAGGTCAT AGATGCTGCA GTTTACATGA TTGACCACCC AACTGCAAAG 6 00 

ATTGATAAAC TCATGGAATT CTTACCTGGA CCAGACTTCC CTACAGGGGC 6 50 

TATTATTCAG GGTCGTGATG AAATCAAGAA AGCTTATGAG ACTGGGAAAG 700 

GGCGCGTGGT TGTTCGTTCC AAGACTGAAA TTGAAAAGCT AAAAGGTGGT 750 

AAGGAACAAA TCGTTATTAC TGAGATTCCT TATGAAATCA ATAAGGCCAA 8 00 

TCTAGTCAAG AAAATCGATG ATGTTCGTGT TAATAACAAG GTAGCTGGGA 850 

TTGCTGAGGT TCGTGATGAG TCTGACCGTG ATGGTCTTCG TATCGCTATC 900 

GAACTTAAGA AAGACGCTAA TACTGAGCTT GTTCTCAACT ACTTATTTAA 950 

GTACACCGAC CTACAAATCA ACTACAACTT TAATATGGTG GCGATTGACA 1000 

ATTTCACACC TCGTCAGGTT GGGATTGTTC CAATCCTGTC TAGCTACATC 1050 

GCTCACCGTC GAGAAGTGAT TTTGGCGCGT TCACGCTTTG ACAAAGAAAA 1100 

GGCTGAGAAA CGTCTCCATA TCGTCGAAGG TTTGATTCGT GTGATTTCGA 1150 

TTTTGGATGA AGTCATTGCT CTTATCCGTG CTTCTGAGAA TAAGGCGGAC 12 00 

GCCAAGGAAA ACCTCAAAGT TAGCTATGAT TTTACGGAAG AACAGGCTGA 12 50 

GGCTATCGTA ACTTTGCAAC TGTACCGTTT GACCAATACC GATGTGGTTG 13 00 

TCTTGCAGGA AGAAGAAGCA GAGCTTCGTG AGAAGATTGC TATGCTGGCG 13 50 

GCTATTATCG GTGATGAAAG GACTATGTAC AATCTCATGA AGAAAGAACT 14 00 

TCGTGAGGTC AAGAAGAAAT TTGCAACTCC TCGTTTGAGT TCTTTAGAAG 14 50 

ACACTGCGAA AGCAATTGAG ATTGATACAG CTAGTCTTAT CGCTGAGGAA 1500 

GATACCTACG TCAGCGTGAC CAAGGCAGGT TACATCAAGC GTACCAGTCC 1550 

ACGTTCCTTT GCGGCTTCCA CCTTGGAAGA AATTGGCAAG CGTGATGATG 16 00 

ACCGTTTGAT TTTTGTTCAA TCTGCCAAGA CAACCCAGCA CCTCTTGATG 1650 

TTCACAAGTC TTGGAAATGT CATCTACAGA CCAATCCATG AGTTGGCAGA 1700 

TATTCGTTGG AAGGACATCG GAGAGCATCT GAGCCAAACC ATCACAAACT 1750 

TTGAAACGAA TGAAGAAATC CTTTATGTGG AAGTACTGGA TCAGTTTGAC 18 00 

GATGCGACAA CCTACTTTGC AGTGACTCGC CTTGGTCAAA TCAAACGGGT 1850 
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AGAGCGAAAA GAATTCACTC CATGGCGGAC 
AGTATGCTAA GCTCAT^GAC GATACAGATC 
ATTAAACTAG ATGATGTTGT CTTGGTTAGT 
TTTCAATATC GAAGAGGTTC CGGTTGTCGG 
AGGCTATGAA TTTGAAAGAA GATGATGTCC 
AATACTTCGT CCTTCTACCT CTTGACCCAG 
TTCTATTGAG GAAATTCTAG CAACCAGCCG 
TCTTGCGTGA GTTGAAAAAC AAACCGCATC 
GTTGCAGAGC AAGGATTTGT TGGCGATTTC 
GAACGACCAA ACTCTGCTTG TCCAATCCAA 
GCCGATTGCA AGACTTGAAC TTGTCAGAAC 
ATTTCTGACA CGATTTCAGA TGAAGAAGTT 
AGTAGTTACT GAAGATAAAT AA 



PCT/CAOO/01150 

CTATAGATCT AAGTCTGTCA 1900 

AGATTGTAGC AGTGGCTCCG 1950 

CAAAATGGTT ATGCCCTGCG 2000 

TGCTAAGGCA GCAGGTGTCA 2050 

TCCAATCTGG CTTTATCTGT 2100 

CGTGGAAGCT TGAAACGTGT 2150 

TGCCAAACGA GGATTACAAG 22 00 

GTGTCTTCTT GGCAGGAGCA 2250 

TTCAGTACGG 7AGTGGATGT 2300 

TAAAGGAACA ATCTATGAAA 23 50 

GCACTAGCAA TGGAAGCTTC 2400 

TTTGACGCTT ATCTTCAGGA 2450 

2472 



2) INFORMATION FOR SEQ ID NO: 2056 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 1212 bases 

(B) TYPE: Nucleic acid 

( C ) STRANDEDNESS : Double 

(D) TOPOLOGY: Linear 

(ii) MOLECULE TYPE: Genomic DNA 

(vi) ORIGINAL SOURCE: 

(A) ORGANISM: Streptococcus pneumonicie 

(B) STRAIN: StrR-16 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 2056 

AAGCTCAAAA ACATCTGTGG GATATCTACA ACTCCGATCA ATACGTCTCT 50 

TACCCTGACG ATGATTTGCA AGTCGCATCT ACGGTCGTAG ATGTTTCAT^ 100 

TGGTAAAGTC ATTGCCCAAC TTGGAGCTCG TCACCAAGCA AGTAACGTTT 150 

CATTTGGCAC CAACCAAGCT GTGGAAACCA ATCGTGACTG GGGTTCAACA 20 0 

ATGAAACCAA TCACAGACTA TGCTCCTGCC TTGGAATACG GTGTCTACGA 250 

TTCAACTGCT ACTATCGTTC ACGATGAGCC CTATAACTAT CCTGGGACAG 300 

ATACCCCTGT CTATAACTGG GATAGGGGCT ACTTTGGCAA TATCACCTTG 3 50 

CAATACGCCC TGCAACAATC GCGAAACGTC CCAGCCGTGG AAACACTAAA 400 

CAAGGTCGGG CTCAACCGCG CCAAGACTTT TCTAAATGGT CTCGGAATCG 450 

ACTACCCAAG TATTCACTAC TCAAATGCCA TTTCAAGTAA CACAACCGAG 500 

TCAGACAAAA AATATGGAGC AAGTAGTGAA AAGATGGCTG CTGCTTACGC 550 

TGCCTTTGCA AATGGTGGAA CTTACTATAA ACCAATGTAT ATCCATAAAG 600 

TCGTCTTTAG TGATGGAAGT GAAAAAGAGT TCTCTAATGT CGGAACTCGC 650 

GCCATGAAAG AAACGACTGC TTACATGATG ACAGAAATGA TGAAAACAGT 700 

CTTGACGTAC GGAATTGGTC GTGGTGCCTA CCTGCCTTGG CTTCCTCAAG 750 

CTGGTAT^AAC AGGTACTTCT AACTATACTG ACGAAGAAAT TGAAAAGTAT 800 

ATCAAGAACA CTGGTTACGT AACTCCAGAT GAAATGTTTG TAGGGTATAC 850 

CCGTAT^TAT GCAATGGCTG TATGGACAGG CTATTCTAAC CGTCTGACAC 900 

CACTTGTAGG CGATGGCCTT ACGGTCGCTG CCAAGGTTTA CCGCTCTATG 950 

ATGACCTACC TGTCTGAAGG AAGCAATCCA GAGGATTGGA ATATACCAGA 1000 

GGGGCTCTAC AGAAATGGAG AATTCGTATT TAAAAATGGT GCTCGTTCTA 1050 

CGTGGAGCTC ACCTGCTCCA CAACAACCCC CATCAACTGA AAGTTCAAGC 1100 

TCATCATCAG ATAGTTCAAC TTCACAGTCT AGCTCAACCA CTCCAAGCAC 1150 

AAATAATAGT ACGACTACCA ATCCTAACAA TAATACGCAA CAATCAAATA 1200 

CAACCCCTGA TC 1212 
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2) INFORMATION FOR SEQ ID NO: 2057 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 1242 bases 

(B) TYPE: Nucleic acid 

(C) STRANDEDNESS : Double 

(D) TOPOLOGY: Linear 

(ii) MOLECULE TYPE: Genomic DNA 

(vi) ORIGINAL SOURCE: 

(A) ORGANISM: Streptococcus pneumoniae 

(B) STRAIN: StrR-17 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 2057 

ATGTAGACCA AGAAGCTCAA AAACATCTGT GGGATATTTA CAATACAGAC 50 

GAATACGTTG CCTATCCAGA CGATGAATTG CAAGTCGCTT CTACCATTGT 100 

TGATGTTTCT AACGGGAAAG TTATTGCTCA GTTAGGTTCT CGTCACCAAT 150 

CAAGCAATGT TTCCTTCGGA ATCAACCAAG CTGTTGAAAC CAACCGTGAC 200 

TGGGGTTCTG CCATGAAGCC AATCACAGAC TATGCTCCTG CCTTAGAGTA 250 

TGACATCTAC GACTCAACTG CTTCGATTGT ACATGATGTT CCTTATAACT 3 00 

ATCCAGGTAC TGATACTCCC CTCTACAACT GGGATAAAGT CTACTTTGGA 350 

AATATTACAA TCCAGTATGC ACTTCAACAG TCACGTAATG TCACAGCCGT 400 

TGAGACTTTG AATAAGGTCG GTCTAGATAG AGCTAAAACC TTTCTTAATG 4 50 

GTCTTGGTAT CGACTATCCA AGCATGCATT ATGCAAACGC CATTTCAAGT 5 00 

AATACGACTG AGTCAAACAA AAAGTACGGA GCAAGTAGTG AGAAAATGGC 550 

TGCTGCTTAC GCTGCTTTTG CTAACGGTGG TATCTACCAT AAACCAATGT 600 

ATATCAACAA AATCGTCTTT AGCGATGGTA GCTCAAAAGA ATACGCTGAT 650 

CCTGGTACTC GTGCCATGAA AGAGACGACC GCCTATATGA TGACAGAAAT 700 

GATGAAGACT GTCTTGGCAT ACGGAACGGG TCGTGGTGCT TATCTCCCTT 750 

GGCTACCTCA AGCTGGTAAG ACTGGTACAT CAAACTATAC AGATGATGAA 800 

ATTGAAAACT ACATCAAAAA TACTGGTTAT GTAGCTCCAG ACGAAATGTT 850 

TGTTGGTTAT ACTCGCAAAT ATTCAATGGC GGTATGGACA GGTTACTCAA 900 

ACCGCCTGAC TCCTATCGTT GGTGATGGCT TCTATGTTGC AGCTAAGGTT 950 

TACCGTTCAA TGATGACTTA TCTGTCTGAG GATAACAACC CTGGCGACTG 1000 

GACTATGCCA GAAGGTCTCT ATCGAAGTGG TGAGTTCGTC TTTAAAAAAG 1050 

GTGCTCGTTC TGCATGGACT GCTCCTGCTC CGCAACAGGC CCCAACACCA 1100 

GAAAGTTCGA GCTCGACATC AGAAAGTTCA ACTTCACAGT CAAGCTCAAC 1150 

TACTCCAAGC ACGAATAATA GTGCAAACAA TAATACCAAT AACCAGCAAC 1200 

CAAATACAAC GCCTGGTCAA CAAAACCAGA ACCAAAATCA GA 1242 



2) INFORMATION FOR SEQ ID NO: 2058 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 1225 bases 

(B) TYPE: Nucleic acid 

(C) STRANDEDNESS: Double 

(D) TOPOLOGY: Linear 

(ii) MOLECULE TYPE: Genomic DNA 
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(vi) ORIGINAL SOURCE: 

(A) ORGANISM: Streptococcus pneumoniae 

(B) STRAIN: StrR-18 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 2058 

GAAGCTCAAA AACATCTGTG GGATATTTAC AATACAGACG AATACGTTGC 5 0 

CTATCCAGAC GATGAATTGC AAGTCGCTTC TACCATTGTT GATGTTTCTA 100 

ACGGGT^AAGT TATTGCTCAG TTAGGTTCTC GTCACCAATC AAGCAATGTT 150 

TCCTTCGGAA TCAACCAAGC TGTTGAAACC AACCGTGACT GGGGTTCTGC 2 00 

CATGAAGCCA ATCACAGACT ATGCTCCTGC CTTAGAGTAT GACATCTACG 250 

ACTCAACTGC TTCGATTGTA CATGATGTTC CTTATAACTA TCCAGGTACT 300 

GATACTCCCC TCTACAACTG GGATAAAGTC TACTTTGGAA ATATTACAAT 350 

CCAGTATGCA CTTCAACAGT CACGTAATGT CACAGCCGTT GAGACTTTGA 400 

ATAAGGTCGG TCTAGATAGA GCTAAAACCT TTCTTAATGG TCTTGGTATC 450 

GACTATCCAA GCATGCATTA TGCAAACGCC ATTTCAAGTA ATACGACTGA 50 0 

GTCAAACAAA AAGTACGGAG CAAGTAGTGA GAAAATGGCT GCTGCTTACG 550 

CTGCTTTTGC TAACGGTGGT ATCTACCATA AACCAATGTA TATCAACAAA 600 

ATCGTCTTTA GCGATGGTAG CTCAAAAGAA TACGCTGATC CTGGTACTCG 650 

TGCCATGAAA GAGACGACCG CCTATATGAT GACAGAAATG ATGAAGACTG 700 

TCTTGGCATA CGGAACGGGT CGTGGTGCTT ATCTCCCTTG GCTACCTCAA 750 

GCTGGTAAGA CTGGTACATC AAACTATACA GATGATGAAA TTGAAAACTA 800 

CATCAAAAAT ACTGGTTATG TAGCTCCAGA CGAAATGTTT GTTGGTTATA 850 

CTCGCAAATA TTCAATGGCG GTATGGACAG GTTACTCAAA CCGCCTGACT 900 

CCTATCGTTG GTGATGGCTT CTATGTTGCA GCTAAGGTTT ACCGTTCAAT 950 

GATGACTTAT CTGTCTGAGG ATAACAACCC TGGCGACTGG ACTATGCCAG 1000 

AAGGTCTCTA TCGAAGTGGT GAGTTCGTCT TTAAAAAAGG TGCTCGTTCT 1050 

GCATGGACTG CTCCTGCTCC GCAACAGGCC CCAACACCAG AAAGTTCGAG 1100 

CTCGACATCA GAAAGTTCAA CTTCACAGTC AAGCTCAACT ACTCCAAGCA 1150 

CGAATAATAG TGCAAACAAT AATACCAATA ACCAGCAACC AAATACAACG 1200 

CCTGGTCAAC AAAACCAGAA CCAAA 1225 



2) INFORMATION FOR SEQ ID NO: 2059 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 554 bases 

(B) TYPE: Nucleic acid 

(C) STRANDEDNESS : Double 

(D) TOPOLOGY: Linear 

(ii) MOLECULE TYPE: Genomic DNA 

(vi) ORIGINAL SOURCE: 

(A) ORGANISM: Streptococcus pneumoniae 

(B) STRAIN: StrR-38 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 2059 

TGGCTACTTT GGAAACATTA CAGTCCAGTA TGCTCTTCAA CAATCACGAA 50 

ATGTCACAGC CGTTGAGACT TTGAATAAGG TCGGTCTAGA TAGAGCTAAA 100 

ACCTTCCTCA ATGGACTTGG TATCGATTAT CCAAGCATGC TTTATGCAAA 150 

CGCCATTTCA AGTAACACAA CTGAATCCAA CAAAAAGTAC GGAGCAAGTA 200 

GTGAAAAAAT GGCCGCTGCC TACGCAGCTT TTGCTAATGG TGGTACTTAC 250 

CACAAACCAA TGTATATCAA TAAAATCGTC TTTAGTGATG GTAGTGAAAA 3 00 

AGAATTTTCT GATGCCGGTA CTCGGGCTAT GAAAGAAACT ACTGCCTATA 350 
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TGATGACCGA AATGATGAAA ACTGTCTTAT 
GCCTACCTAC CTTGGCTTCC ACAAGCAGGT 
TACTGACGAA GAAATTGAAA AGTATATCAA 
CAGATGAAAT GTTTGTTGGT TATACCCGCA 
ACAG 



TATACGGAAC CGGACGTGGA 4 00 

AAGACAGGTA CTTCTAACTA 4 50 

GAATGCTGGT TACGTAGCTC . 500 
AATATGCAAT GGCTGTTTGG 550 

554 



2) INFORMATION FOR SEQ ID NO: 2060 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 1249 bases 

(B) TYPE: Nucleic acid 

(C) STRANDEDNESS: Double 

(D) TOPOLOGY: Linear 

(ii) MOLECULE TYPE: Genomic DNA 

(vi) ORIGINAL SOURCE: 

(A) ORGANISM: Streptococcus pneumoniae 

(B) STRAIN: StrR-57 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 2060 

CAACTGGGAT GGATGTCTAC ACAAATGTAG ACCAAGAAGC TCAAAAACAT 50 

CTGTGGGATA TTTACAATAC AGACGAATAC GTTGCCTATC CAGACGATGA 100 

ATTGCAAGTC GCTTCTACCA TTGTTGATGT TTCTAACGGT AAAGTCATTG 150 

CCCAGCTAGG AGCACGCCAT CAGTCAAGTA ATGTTTCCTT CGGAATTAAC 2 00 

CAAGCAGTAG AAACAAACCG CGACTGGGGA TCAACTATGA AACCGATCAC 250 

AGACTATGCT CCTGCCTTGG AGTACGGTGT CTACGATTCA ACTGCTACTA 3 00 

TCGTTCACGA TGAGCCCTAT AACTACCCTG GGACAAATAC TCCTGTTTAT 3 50 

AACTGGGATA GGGGCTACTT TGGCAACATC ACCTTGCAAT ACGCCCTGCA 4 00 

ACAATCGCGA AACGTCCCAG CCGTGGAAAC TCTAAACAAG GTCGGACTCA 4 50 

ACCGCGCCAA GACTTTCCTA AATGGTCTAG GAATCGACTA CCCAAGTATT 500 

CACTACTCAA ATGCCATTTC AAGTAACACA ACCGAATCAG ACAAAAAATA 550 

TGGAGCAAGT AGTGAAAAGA TGGCTGCTGC TTACGCTGCC TTTGCAAATG 600 

GTGGAACTTA CTATAAACCA ATGTATATCC ATAAAGTCGT CTTTAGTGAT 650 

GGGAGTGAAA AAGAGTTCTC TAATGTCGGA ACTCGTGCCA TGAAGGAAAC 700 

GACAGCCTAT ATGATGACCG ACATGATGAA AACAGTCTTG ACTTATGGAA 750 

CTGGACGAAA TGCCTATCTT GCTTGGCTCC CTCAGGCTGG TAAAACAGGA 800 

ACCTCTAACT ATACAGACGA GGAAATTGAA AACCACATCA AGACCTCTCA 850 

ATTTGTAGCA CCTGATGAAC TATTTGCTGG CTATACGCGT AAATATTCAA 900 

TGGCTGTATG GACAGGCTAT TCTAACCGTC TGACACCACT TGTAGGCAAT 950 

GGCCTTACGG TCGCTGCCAA AGTTTACCGC TCTATGATGA CCTACCTGTC 1000 

TGAAGGAAGC AATCCAGAAG ATTGGAATAT ACCAGAGGGG CTCTACAGAA 1050 

ATGGAGAATT CGTATTTAAA AATGGTGCTC GTTCTACGTG GAGCTCACCT 1100 

GCTCCACAAC AACCCCCATC AACTGAAAGT TCAAGCTCAT CATCAGATAG 1150 

TTCAACTTCA CAGTCTAGCT CAACCACTCC AAGCACAAAT AATAGTACGA 1200 

CTACCAATCC TAACAATAAT ACGCAACAAT CAAATACAAC CCCTGATCA 1249 



2) INFORMATION FOR SEQ ID NO: 2061 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 579 bases 

(B) TYPE: Nucleic acid 
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(C) STRANDEDNESS: Double 

(D) TOPOLOGY: Linear 

(ii) MOLECULE TYPE: Genomic DNA 

(vi) ORIGINAL SOURCE: 

(A) ORGANISM: StrGptococcus pneumoniae 

(B) STRAIN: StrR-60 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 2061 

GAAGTTGAAC AGACTGGTCA CCAAGCTCCT TCATATCCGC CAACACTGCA 50 

ATTTTCTTGC CACCTTCATT GGCTGGAATG GCAGAGAAAG TCTCTAAAAT 100 

CAGTTTCATA GCAGTTGGAT TGGCATTATA AACATCTGAC AGGATATCTG 150 

CTCCATTGGC TGCTTTCTTC CACTCGGTAC GGTTACGCGT CAATTCAAGA 200 

TGTTGGAAGG CCAAACGAAT TTGCTCCTCT GAAACTCCTT CTTGCAAGGC 2 50 

AACATAGGAT GCAATCATAG CATTTGTCGC ATTGTACTTG CCAGTTACTG 300 

GCAAATCAAG GGCTTGCTCT AAGAAATTGA CCTTGAAGGT CAGACTATCT 350 

TTGCGCTCAA CCAAGTCGGT AATTCCCAGC TCTGCTCCTT GACCAAAACG 4 00 

AACCACCTTT TTATCAGTTG GCAAATAGTC CTCTACGATA GGGTCAGCCG 450 

GCGCTAAAAG CAAGGAACCT GAAGCCATTC CGTCTGCAAT TTGCATTTTT 500 

CCTTTAGCAA TCTCAGAACG GTCTTTGAAA AAGGCCAAAT GAGCTTCTCC 550 

AACCAAGGTC ACGATGGCTG TATGGACAG 579 



2) INFORMATION FOR SEQ ID NO: 2062 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 1216 bases 

(B) TYPE: Nucleic acid 

(C) STRANDEDNESS: Double 

(D) TOPOLOGY: Linear 

(ii) MOLECULE TYPE: Genomic DNA 

(vi) ORIGINAL SOURCE: 

(A) ORGANISM: Streptococcus pneumoniae 

(B) STRAIN: StrR-63 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 2062 

AAGCTCAAAA ACATCTGTGG GATATCTACA ACTCCGATCA ATACGTCTCT 50 

TACCCTGACG ATGATTTGCA AGTCGCATCT ACGGTCGTAG ATGTTTCAAA 100 

TGGTAAAGTC ATCGCACAAC TTGGTGCTCG TCATCAAGCA AGTAATGTTT 150 

CATTCGGTAC CAACCAGGCC GTAGAAACCA ATCGTGACTG GGGATCATCA 200 

ATGAAACCAA TCACTGACTA TGCTCCCGCT TTAGAATATG GAGTCTATGA 250 

CTCTACTGCT TCTATTGTAC ATGATGTCCC TTATAACTAT CCTGGCACTG 300 

ATACTCCACT CTACAACTGG GATCATGTCT ACTTTGGAAA CATTACAATC 350 

CAGTATGCTC TTCAACAATC ACGAAATGTC ACAGCCGTTG AGACTTTGAA 400 

TAAGGTCGGT CTAGATAGAG CTAAAACCTT CCTTAATGGT CTTGGTATCG 450 

ACTATCCAAG CATGCATTAT GCAAACGCCA TTTCAAGTAA CACAACTGAA 500 

TCCAACAAAA AATATGGTGC AAGTAGTGAA AAAATGGCTG CTGCCTACGC 550 

TGCTTTTGCT AATGGTGGTA TTTATCACAA ACCAATGTAT ATCAATAAAA 6 00 

TCGTCTTTAG TGATGGTAGC GAAAAAGAAT TTTCTGATGC TGGTACACGA 650 

GCTATGAAAG AGACTACTGC CTATATGATG ACTGAAATGA TGAAT^CTGT 700 

TTTAACTTAC GGAACAGGAC GTGGAGCCTA CCTACCATGG CTTCCACAAG 750 
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CAGGTAAGAC AGGTACTTCT 
ATCAAGAACA CTGGTTACGT 
CCGTAAATAT GCAATGGCTG 
CAATCATCGG AGATGGTTTC 
ATAACTTACC TTTCTGAAGA 
TGGCTTGTAT AGAAATGGAG 
CGTGGAGCTC ACCTGCTCCA 
TCATCATCAG ATAGTTCAAC 
AAATAATAGT ACGACTACCA 
CAACCCCTGA TCAACA 



AACTATACTG ACGAAGAAAT TGAAAAGTAT 
AGCTCCAGAT GAAATGTTTG TAGGGTATAC 
TTTGGACAGG ATACTCAT^T CGTCTAACTC 
CTTGTTGCTG GTAAAGTCTA TCGTTCAATG 
TGACCAACCT GGAGATTGGA CAATGCCAGA 
AATTCGTATT TAAAAATGGT GCTCGTTCTA 
CAACAACCCC CATCAACTGA AAGTTCAAGC 
TTCACAGTCT AGCTCAACCA CTCCAAGCAC 
ATCCTAACAA TAATACGCAA CAATCAAATA 



800 
850 
900 
950 
1000 
1050 
1100 
1150 
1200 
1216 



2) INFORMATION FOR SEQ ID NO: 2063 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 810 bases 

(B) TYPE: Nucleic acid 

(C) STRANDEDNESS : Double 

(D) TOPOLOGY: Linear 

(ii) MOLECULE TYPE: Genomic DNA 

(vi) ORIGINAL SOURCE: 

(A) ORGANISM: Streptococcus pneumoniae 

(B) STRAIN: ATCC 700673 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 2063 

AATACGTCTC TTACCCTGAC GATGATTTGC AAGTCGCATC CACGGTCGTA 50 

GATGTTTCAA ATGGTAAAGT CATCGCCCAA CTTGGAGCTC GTCACCAAGC 100 

AAGTAACGTT TCATTTGGTA CCAACCAAGC TGTGGAAACC AATCGTGACT 150 

GGGGTTCAAC AATGAAACCC ATCACCGATT ATGCACCTGC CATAGAATAC 200 

GGTGTATATG ATTCCACTGC AACTATGGTT AATGATATTC CTTATAACTA 250 

TCCGGGAACA AGCACACCTG TCTACAACTG GGATCGAGCA TATTTTGGTA 3 00 

ATATTACTCT GCAATATGCC CTTCAACAAT CTCGTAACGT ACCCGCCGTT 350 

GAGACACTAA ACAAGGTTGG TTTAGATAGA GCTAAAAGTT TCCTAAATGG 4 00 

TTTAGGAATC GACTATCCTG TAATGCACTA TTCAAATGCT ATTTCAAGTA 450 

ATACTACCGA ATCTAGTAAA CAGTACGGGG CAAGTAGTGA AAAAATGGCC 500 

ACTGCCTATG CCGCATTCGC AAACGGCGGT ATTTACCACA AACCAATGTA 550 

CATCAATAAG GTTGTCTTTA GCGATGGTAG CGAAAAAGAA TTTTCTGACC 600 

CTGGCACAAG AGCCATGAAA GAAACGACTG CTTACATGAT GACAGAGATG 6 50 

ATGAAAACAG TCTGGACTTA CGGAACTGGT CGCGGTGCCT ACCTACCTTG 700 

GCTTCCACAA GCAGGTAAAA CAGGTACTTC TAACTATACT GACGAAGAAA 750 

TTGAAAAGTA TATCAAGAAC ACTGGTTACG TAGCTCCAQA TGAAATGTTT 80 0 

GTAGGGTATA 810 



2) INFORMATION FOR SEQ ID NO: 2064 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 782 bases 

(B) TYPE: Nucleic acid 

(C) STRANDEDNESS: Double 

(D) TOPOLOGY: Linear 
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(vi) ORIGINAL SOURCE: 

(A) ORGANISM: Streptococcus pneumoniae 

(B) STRAIN: ATCC 700678 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 2064 

TCTTACCCTG ACGATGATTT GCAAGTCGCA TCTACGGTCG TAGATGTTTC 50 

AAATGGTAAA GTCATCGCCC AACTTGGAGC TCGTCACCAA GCAAGTAACG 100 

TTTCATTTGG TACCAACCAA GCTGTGGAAA CCAATCGTGA CTGGGGATCA 150 

ACTATGAAAC CAATCACAGA CTATGCTCCT GCCTTGGAGT ACGGTGTCTA 2 00 

CGATTCAACT GCTACTATCG TTCACGATGA GCCCTATAAC TACCCTGGGA 250 

CAGATATCCC TCTCTATAAC TGGGATCGAG CATATTTCGG TAATATTACT 3 00 

CTGCAATATG CCCTTCAACA ATCTCGTAAC GTACCTGCCG TTGAAACACT 350 

AAACAAGGTC GGTCTAGATA AGGCTAAAAC CTTCCTTAAT GGTCTTGGTA 4 00 

TCGACTATCC AAGCATGCAT TATGCAAACG CCATTTCAAG TAATACAACT 450 

GAATCCAACA AAAAATATGG TGCAAGTAGT GAAAT^AATAG CTACCGCCTA 500 

TGCCGCATTC GCAAATGGTG GTATTTACCA CAAACCAATG TACATCAATA 550 

AAGTTGTCTT TAGCGATGGT AGCGAAAAAG AATTTTCTGA CCCTGGCACA 600 

AGAGCCATGA AAGAAACGAC TGCTTACATG ATGACAGAAA TGATGAAAAC 650 

AGTCTGGACG TACGGAACTG GTCGTGGTGC CTACCTGCCT TGGCTTCCTC 700 

AAGCTGGTAA AACAGGTACC TCTAACTATA CTGACGT^GA AATTGAAAAG 750 

TATATCAAGA ACACTGGTTA CGTAGCTCCA GA 782 



2) INFORMATION FOR SEQ ID NO: 2065 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 22 bases 

(B) TYPE: Nucleic acid 

(C) STRANDEDNESS: Single 

(D) TOPOLOGY: Linear 

(ii) MOLECULE TYPE: DNA 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 2065 
CCAGGACGTG GAGGCGATCA CA 22 



2) INFORMATION FOR SEQ ID NO: 2066 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 22 bases 

(B) TYPE: Nucleic acid 

(C) STRANDEDNESS: Single 

(D) TOPOLOGY: Linear 

(ii) MOLECULE TYPE: DNA 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 2066 
CACCGACAGC GAGCCGATCA GA 22 
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2) INFORMATION FOR SEQ ID NO: 2067 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 18 bases 

(B) TYPE: Nucleic acid 

(C) STRANDEDNESS: Single 

(D) TOPOLOGY: Linear 

(ii) MOLECULE TYPE: DNA 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 2067 
AGCTGAGCCA ATTCATGG 



2) INFORMATION FOR SEQ ID NO: 2068 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 19 bases 

(B) TYPE: Nucleic acid 

(C) STRANDEDNESS: Single 

(D) TOPOLOGY: Linear 

(ii) MOLECULE TYPE: DNA 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 1068 
ATTCATiSGAC CAGAACAAC 



2) INFORMATION FOR SEQ ID NO: 2069 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 18 bases 

(B) TYPE: Nucleic acid 

(C) STRANDEDNESS: Single 

(D) TOPOLOGY: Linear 

(ii) MOLECULE TYPE: DNA 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 2069 
CGCTGTCGGG GTTGACCC 



2) INFORMATION FOR SEQ ID NO: 2070 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 18 bases 

(B) TYPE: Nucleic acid 

(C) STRANDEDNESS: Single 
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(D) TOPOLOGY: Linear 

(ii) MOLECULE TYPE: DNA 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 2070 
GTTGACCCAC AAGCGCCG 



2) INFORMATION FOR SEQ ID NO: 2071 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 18 bases 

(B) TYPE: Nucleic acid 

(C) STRANDEDNESS: Single 

(D) TOPOLOGY: Linear 

(ii) MOLECULE TYPE: DNA 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 2071 
CGACTGTCGG CGCTGGGG 



2) INFORMATION FOR SEQ ID NO: 2072 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 3534 bases 

(B) TYPE: Nucleic acid 

(C) STRANDEDNESS: Double 

(D) TOPOLOGY: Linear 

(ii) MOLECULE TYPE: Genomic DNA 

(vi) ORIGINAL SOURCE: 

(A) ORGANISM: Mycobacterium tuberculosis 

(B) STRAIN: Rv 

(C) ACCESSION NUMBER: L27989 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 2072 

GTGCTGGAAG GATGCATCTT GGCAGATTCC CGCCAGAGCA AAACAGCCGC 50 

TAGTCCTAGT CCGAGTCGCC CGCAAAGTTC CTCGAATAAC TCCGTACCCG 100 

GAGCGCCAAA CCGGGTCTCC TTCGCTAAGC TGCGCGAACC ACTTGAGGTT 150 

CCGGGACTCC TTGACGTCCA GACCGATTCG TTCGAGTGGC TGATCGGTTC 200 

GCCGCGCTGG CGCGAATCCG CCGCCGAGCG GGGTGATGTC AACCCAGTGG 250 

GTGGCCTGGA AGAGGTGCTC TACGAGCTGT CTCCGATCGA GGACTTCTCC 3 00 

GGGTCGATGT CGTTGTCGTT CTCTGACCCT CGTTTCGACG ATGTCAAGGC 350 

ACCCGTCGAC GAGTGCAAAG ACAAGGACAT GACGTACGCG GCTCCACTGT 400 

TCGTCACCGC CGAGTTCATC AACAACAACA CCGGTGAGAT CAAGAGTCAG 450 

ACGGTGTTCA TGGGTGACTT CCCGATGATG ACCGAGAAGG GCACGTTCAT 500 

CATCAACGGG ACCGAGCGTG TGGTGGTCAG CCAGCTGGTG CGGTCGCCCG 550 

GGGTGTACTT CGACGAGACC ATTGACAAGT CCACCQACAA GACGCTGCAC 600 

AGCGTCAAGG TGATCCCGAG CCGCGGCGCG TGGCTCGAGT TTGACGTCGA' 650 

CAAGCGCGAC ACCGTCGGCG TGCGCATCGA CCGCAAACGC CGGCAACCGG 700 
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TCACCGTGCT GCTCAAGGCG CTGGGCTGGA CCAGCGAGCA GATTGTCGAG 750 

CGGTTCGGGT TCTCCGAGAT CATGCGATCG ACGCTGGAGA AGGACAACAC 800 

CGTCGGCACC GACGAGGCGC TGTTGGACAT CTACCGCAAG CTGCGTCCGG 850 

GCGAGCCCCC GACCAAAGAG TCAGCGCAGA CGCTGTTGGA AAACTTGTTC 900 

TTCAAGGAGA AGCGCTACGA CCTGGCCCGC GTCGGTCGCT ATAAGGTCAA 950 

CAAGAAGCTC GGGCTGCATG TCGGCGAGCC CATCACGTCG TCGACGCTGA 1000 

CCGAAGAAGA CGTCGTGGCC ACCATCGAAT ATCTGGTCCG CTTGCACGAG 1050 

GGTCAGACCA CGATGACCGT TCCGGGCGGC GTCGAGGTGC CGGTGGAAAC 1100 

CGACGACATC GACCACTTCG GCAACCGCCG CCTGCGTACG GTCGGCGAGC 1150 

TGATCCAAAA CCAGATCCGG GTCGGCATGT CGCGGATGGA GCGGGTGGTC 1200 

CGGGAGCGGA TGACCACCCA GGACGTGGAG GCGATCACAC CGCAGACGTT 1250 

GATCAACATC CGGCCGGTGG TCGCCGCGAT CAAGGAGTTC TTCGGCACCA 1300 

GCCAGCTGAG CCAATTCATG GACCAGAACA ACCCGCTGTC GGGGTTGACC 1350 

CACAAGCGCC GACTGTCGGC GCTGGGGCCC GGCGGTCTGT CACGTGAGCG 1400 

TGCCGGGCTG GAGGTCCGCG ACGTGCACCC GTCGCACTAC GGCCGGATGT 14 50 

GCCCGATCGA AACCCCTGAG GGGCCCAACA TCGGTCTGAT CGGCTCGCTG 1500 

TCGGTGTACG CGCGGGTCAA CCCGTTCGGG TTCATCGAAA CGCCGTACCG 1550 

CAAGGTGGTC GACGGCGTGG TTAGCGACGA GATCGTGTAC CTGACCGCCG 1600 

ACGAGGAGGA CCGCCACGTG GTGGCACAGG CCAATTCGCC GATCGATGCG 1650 

GACGGTCGCT TCGTCGAGCC GCGCGTGCTG GTCCGCCGCA AGGCGGGCGA 1700 

GGTGGAGTAC GTGCCCTCGT CTGAGGTGGA CTACATGGAC GTCTCGCCCC 1750 

GCCAGATGGT GTCGGTGGCC ACCGCGATGA TTCCCTTCCT GGAGCACGAC 18 0 0 

GACGCCAACC GTGCCCTCAT GGGGGCAAAC ATGCAGCGCC AGGCGGTGCC 1850 

GCTGGTCCGT AGCGAGGCCC CGCTGGTGGG CACCGGGATG GAGCTGCGCG 1900 

CGGCGATCGA CGCGGCGACG TCGTCGTCGC AAGAAAGCGG CGTCATCGAG 1950 

GAGGTGTCGG CCGACTACAT CACTGTGATG CACGACAACG GCACCCGGCG 2000 

TACCTACCGG ATGCGCAAGT TTGCCCGGTC CAACCACGGC ACTTGCGCCA 2050 

ACCAGTGCCC CATCGTGGAC GCGGGCGACC GAGTCGAGGC CGGTCAGGTG 2100 

ATCGCCGACG GTCCCTGTAC TGACGACGGC GAGATGGCGC TGGGCAAGAA 2150 

CCTGCTGGTG GCCATCATGC CGTGGGAGGG CCACAACTAC GAGGACGCGA 2200 

TCATCCTGTC CAACCGCCTG GTCGAAGAGG ACGTGCTCAC CTCGATCCAC 2250 

ATCGAGGAGC ATGAGATCGA TGCTCGCGAC ACCAAGCTGG GTGCGGAGGA 23 00 

GATCACCCGC GACATCCCGA ACATCTCCGA CGAGGTGCTC GCCGACCTGG 2350 

ATGAGCGGGG CATCGTGCGC ATCGGTGCCG AGGTTCGCGA CGGGGACATC 24 00 

CTGGTCGGCA AGGTCACCCC GAAGGGTGAG ACCGAGCTGA CGCCGGAGGA 24 50 

GCGGCTGCTG CGTGCCATCT TCGGTGAGAA GGCCCGCGAG GTGCGCGACA 2500 

CTTCGCTGAA GGTGCCGCAC GGCGAATCCG GCAAGGTGAT CGGCATTCGG 2550 

GTGTTTTCCC GCGAGGACGA GGACGAGTTG CCGGCCGGTG TCAACGAGCT 260 0 

GGTGCGTGTG TATGTGGCTC AGAAACGCAA GATCTCCGAC GGTGACAAGC 2 65 0 

TGGCCGGCCG GCACGGCAAC AAGGGCGTGA TCGGCAAGAT CCTGCCGGTT 2700 

GAGGACATGC CGTTCCTTGC CGACGGCACC CCGGTGGACA TTATTTTGAA 2750 

CACCCACGGC GTGCCGCGAC GGATGAACAT CGGCCAGATT TTGGAGACCC 280 0 

ACCTGGGTTG GTGTGCCCAC AGCGGCTGGA AGGTCGACGC CGCCAAGGGG 2850 

GTTCCGGACT GGGCCGCCAG GCTGCCCGAC GAACTGCTCG AGGCGCATGC 2900 

GAACGCCATT GTGTCGACGC CGGTGTTCGA CGGCGCCCAG GAGGCCGAGC 2 950 

TGCAGGGCCT GTTGTCGTGC ACGCTGCCCA ACCGCGACGG TGACGTGCTG 3000 

GTCGACGCCG ACGGCAAGGC CATGCTCTTC GACGGGCGCA GCGGCGAGCC 3050 

GTTCCCGTAC CCGGTCACGG TTGGCTACAT GTACATCATG AAGCTGCACC 3100 

ACCTGGTGGA CGACAAGATC CACGCCCGCT CCACCGGGCC GTACTCGATG 3150 

ATCACCCAGC AGCCGCTGGG CGGTAAGGCG CAGTTCGGTG GCCAGCGGTT 3200 

CGGGGAGATG GAGTGCTGGG CCATGCAGGC CTACGGTGCT GCCTACACCC 3250 

TGCAGGAGCT GTTGACCATC AAGTCCGATG ACACCGTCGG CCGCGTCAAG 33 00 

GTGTACGAGG CGATCGTCAA GGGTGAGAAC ATCCCGGAGC CGGGCATCCC 3350 

CGAGTCGTTC AAGGTGCTGC TCAAAGAACT GCAGTCGCTG TGCCTCAACG 3400 

TCGAGGTGCT ATCGAGTGAC GGTGCGGCGA TCGAACTGCG CGAAGGTGAG 3450 

GACGAGGACC TGGAGCGGGC CGCGGCCAAC CTGGGAATCA ATCTGTCCCG 3500 

CAACGAATCC GCAAGTTTCG AGGATCTTGC GTAA 3534 
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2) INFORMATION FOR SEQ ID NO: 2073 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 38 bases 

(B) TYPE: Nucleic acid 

(C) STRANDEDNESS: Single 

(D) TOPOLOGY: Linear 

(ii) MOLECULE TYPE: DNA 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 2073 
CCGAGCAACA TGATTGAACC ATCCACCAAC TGGCTCGG 38 



2) INFORMATION FOR SEQ ID NO: 2074 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 38 bases 

(B) TYPE: Nucleic acid 

(C) STRANDEDNESS: Single 

(D) TOPOLOGY: Linear 

(ii) MOLECULE TYPE: DNA 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 2074 
CCGAGCAACA TGATTGAAGC TTCCACCAAC TGGCTCGG 38 



2) INFORMATION FOR SEQ ID NO: 2075 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 41 bases 

(B) TYPE: Nucleic acid 

(C) STRANDEDNESS: Single 

(D) TOPOLOGY: Linear 

(ii) MOLECULE TYPE: DNA 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 2075 
CCGAGCCAGG TTCTGAAGTC TCTGCATTAT TAGGTGCTCG G 41 



2) INFORMATION FOR SEQ ID NO: 2076 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 39 bases 

(B) TYPE: Nucleic acid 

(C) STRANDEDNESS: Single 
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(D) TOPOLOGY: Linear 

(ii) MOLECULE TYPE: DNA 

(xi) SEQUENCE DESCRIPTION: SEQ 

CCGAGCYGAY AACATTTTCA GATTCACCCA 
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V. 

ID NO: 2076 

RGCGCTCGG 39 



2) INFORMATION FOR SEQ ID NO: 2077 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 33 bases 

(B) TYPE: Nucleic acid 

(C) STRANDEDNESS: Single 

(D) TOPOLOGY: Linear 

(ii) MOLECULE TYPE: DNA 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 2077 
CCGAGCAACC GATCCAGCTC CAGCTACGCT CGG 33 



2) INFORMATION FOR SEQ ID NO: 2078 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 34 bases 

(B) TYPE: Nucleic acid 

(C) STRANDEDNESS: Single 

(D) TOPOLOGY: Linear 

(ii) MOLECULE TYPE: DNA 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 2078 
CCGAGCCTTG GTCTTCGGCC AAATGAACGC TCGG 34 



2) INFORMATION FOR SEQ ID NO: 2079 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 36 bases 

(B) TYPE: Nucleic acid 

(C) STRANDEDNESS: Single 

(D) TOPOLOGY: Linear 

(ii) MOLECULE TYPE: DNA 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 2079 
CCGAGCGTTC AGTTACTTCA GTCCAAGCCG GCTCGG 36 
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2) INFORMATION FOR SEQ ID NO: 2080 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 31 bases 

(B) TYPE: Nucleic acid 

(C) STRANDEDNESS: Single 

(D) TOPOLOGY: Linear 

(ii) MOLECULE TYPE: DNA 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 2080 
CCGAGCCGAA GAGGGCCAAG ATGTCGCTCG G 



2) INFORMATION FOR SEQ ID NO: 2081 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 24 bases 

(B) TYPE: Nucleic acid 

(C) STRANDEDNESS: Single 

(D) TOPOLOGY: Linear 

(ii) MOLECULE TYPE: DNA 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 2081 
GRATYRTYAA AGTTGGTGAG GAAG 



2) INFORMATION FOR SEQ ID NO: 2082 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 22 bases 

(B) TYPE: Nucleic acid 

(C) STRANDEDNESS: Single 

(D) TOPOLOGY: Linear 

(ii) MOLECULE TYPE: DNA 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 2082 
CMACTTCATC YCGCTTCGTA CC 



2) INFORMATION FOR SEQ ID NO: 2083 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 44 bases 

(B) TYPE: Nucleic acid 

(C) STRANDEDNESS: Single 
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(D) TOPOLOGY: Linear 

(ii) MOLECULE TYPE: DNA 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 2083 
CCGCCGATGT TCCGTAAATT ACTTGAIGAA GGTCGAGCCG GCGG 44 

2) INFORMATION FOR SEQ ID NO: 2084 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 38 bases 

(B) TYPE: Nucleic acid 

(C) STRANDEDNESS: Single 

(D) TOPOLOGY: Linear 

(ii) MOLECULE TYPE: DNA 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 2084 
CACGCGTCAA CACCCGTACA AGTCGTCTTT TGCGCGTG 38 

2) INFORMATION FOR SEQ ID NO: 2085 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 24 bases 

(B) TYPE: Nucleic acid 

(C) STRANDEDNESS: Single 

(D) TOPOLOGY: Linear 

(ii) MOLECULE TYPE: DNA 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 2085 
CAAACTAAAG T^CATATCTT GCTA 24 

2) INFORMATION FOR SEQ ID NO: 2086 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 23 bases 

(B) TYPE: Nucleic acid 

(C) STRANDEDNESS: Single 

(D) TOPOLOGY: Linear 

(ii) MOLECULE TYPE: DNA 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 2086 
ATATAATTTG CATCACCTTC AAG 23 
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2) INFORMATION FOR SEQ ID NO: 2087 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 22 bases 

(B) TYPE: Nucleic acid 

(C) STRANDEDNESS: Single 

(D) TOPOLOGY: Linear 

(ii) MOLECULE TYPE: DNA 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 2087 
TCAGCTCGTG GGATTAGGAG AG 



2) INFORMATION FOR SEQ ID NO: 2088 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 22 bases 

(B) TYPE: Nucleic acid 

(C) STRANDEDNESS: Single 

(D) TOPOLOGY: Linear 

(ii) MOLECULE TYPE: DNA 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 2088 
AGGCTTCACG CTGTTAGGCT GA 



2) INFORMATION FOR SEQ ID NO: 2089 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 20 bases 

(B) TYPE: Nucleic acid 

(C) STRANDEDNESS: Single 

(D) TOPOLOGY: Linear 

(ii) MOLECULE TYPE: DNA 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 2089 
ATGCTGAACT TATTGACCTT 



2) INFORMATION FOR SEQ ID NO: 2090 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 19 bases 

(B) TYPE: Nucleic acid 

(C) STRANDEDNESS: Single 
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(D) TOPOLOGY: Linear 

(ii) MOLECULE TYPE: DNA 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 2090 
CGTTACTGGA GTCGAAATG 



2) INFORMATION FOR SEQ ID NO: 2091 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 39 bases 

(B) TYPE: Nucleic acid 

(C) STRANDEDNESS: Single 

(D) TOPOLOGY: Linear 

(ii) MOLECULE TYPE: DNA. 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 2091 
CGCGACTTGA GATGGAACTT AGTGAGCTTC TTGGTCGCG 



2) INFORMATION FOR SEQ ID NO: 2092 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 38 bases 

(B) TYPE: Nucleic acid 

(C) STRANDEDNESS: Single 

(D) TOPOLOGY: Linear 

(ii) MOLECULE TYPE: DNA 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 2092 
CGCGACGAAA GAACTTCCTG AAGGTCGTGC AGGTCCAG 



2) INFORMATION FOR SEQ ID NO: 2093 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 20 bases 

(B) TYPE: Nucleic acid 

(C) STRANDEDNESS: Single 

(D) TOPOLOGY: Linear 

(ii) MOLECULE TYPE: DNA 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 2093 
TGTTGGCAAT CGAAGACACC 
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2) INFORMATION FOR SEQ ID NO: 2094 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 24 bases 

(B) TYPE: Nucleic acid 

(C) STRANDEDNESS: Single 

(D) TOPOLOGY: Linear 

(ii) MOLECULE TYPE: DNA 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 2094 

TTCAATTTCT TGACCTACTT TCAA 

2) INFORMATION FOR SEQ ID NO: 2095 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 18 bases 

(B) TYPE: Nucleic acid 

(C) STRANDEDNESS: Single 

(D) TOPOLOGY: Linear 

(ii) MOLECULE TYPE: DNA 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 2095 
CGGTCGGGTT GAACGTGG 



2) INFORMATION FOR SEQ ID NO: 2096 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 34 bases 

(B) TYPE: Nucleic acid 

(C) STRANDEDNESS: Single 

(D) TOPOLOGY: Linear 

(ii) MOLECULE TYPE: DNA 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 2096 
CGCGACCGGT ACCACGGCCA GTAATCGTGT CGCG 
2) INFORMATION FOR SEQ ID NO: 2097 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 1185 bases 

(B) TYPE: Nucleic acid 

(C) STRANDEDNESS: Double 

(D) TOPOLOGY: Linear 

(ii) MOLECULE TYPE: Genomic DNA 
(vi) ORIGINAL SOURCE: 
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(A) ORGANISM: Mycoplasma pneumoniae 

(B) STRAIN: ATCC 29342 

(C) ACCESSION NUMBER: AE000019 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 2097 

ATGGCAAGAG AGAAATTTGA CCGATCTAAA CCCCACGTTA ATGTAGGTAC 50 

TATTGGCCAC ATTGACCACG GTAAAACAAC TTTAACAGCA GCTATTTGTA 100 

CTGTATTAGC AAAAGAAGGT AAATCAGCTG CTACTCGTTA CGACCAAATC 150 

GATAAGGCTC CGGAAGAAAA AGCACGGGGA ATTACGATTA ACTCCGCTCA 2 00 

CGTGGAGTAC TCCTCTGACA AGCGTCACTA TGCTCACGTT GACTGTCCAG 250 

GACACGCTGA CTACATTAAG AACATGATTA CTGGTGCTGC ACAAATGGAT 3 00 

GGTGCCATTC TAGTAGTTTC AGCAACTGAC AGTGTTATGC CCCAAACCCG 3 50 

TGAACACATT TTGTTGGCCC GCCAAGTGGG TGTGCCACGC ATGGTAGTGT 4 00 

TCCTAAACAA GTGTGACATT GCAACTGATG AAGAAGTGCA AGAGTTAGTA 450 

GCAGAAGAGG TACGTGACTT ATTAACTTCT TACGGCTTTG ATGGCAAGAA 500 

CACCCCTATT ATTTATGGTT CTGCACTTAA AGCGCTTGAA GGTGATCCTA 550 

AGTGGGAAGC TAAGATCCAT GATTTAATGA ATGCAGTTGA TGAATGGATT 600 

CCAACTCCTG AACGTGAAGT GGACAAACCC TTCTTGTTGG CAATCGAAGA 650 

CACCATGACG ATTACTGGCC GTGGTACCGT GGTTACCGGT CGGGTTGAAC 700 

GTGGTGAATT GAAAGTAGGT CAAGAAATTG A7y\TCGTTGG TTTACGTCCA 750 

ATCCGTAAAG CAGTTGTTAC CGGAATCGAA ATGTTCAAAA AGGAACTTGA 800 

TTCAGCAATG GCTGGGGACA ACGCTGGGGT ATTACTCCGT GGTGTGGACC 850 

GTAAAGAAGT GGAACGTGGT CAAGTGTTAG CTAAACCAGG TTCGATTAAA 900 

CCGCACAAGA AATTTAAAGC GGAAATCTAT GCTTTAAAGA AGGAAGAAGG 950 

TGGTCGTCAC ACCGGTTTCT TAAACGGTTA CCGTCCCCAA TTCTACTTCC 1000 

GTACTACAGA CGTTACTGGT TCGATTTCCC TACCAGAAAA CACCGAAATG 1050 

GTGCTACCAG GTGACAATAC CTCGATTACA GTTGAACTAA TTGCACCAAT 1100 

TGCTTGTGAA AAAGGTAGTA AGTTCTCCAT CCGTGAAGGT GGTCGAACGG 1150 

TTGGTGCTGG TTCAGTCACG GAAGTGCTTG AATAG 1185 



2) INFORMATION FOR SEQ ID NO: 2098 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 22 bases 

(B) TYPE: Nucleic acid 

(C) STRANDEDNESS : Single 

(D) TOPOLOGY: Linear 

(ii) MOLECULE TYPE: DNA 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 2098 
CTGAGTCACA CCGACAAACG TC 



2) INFORMATION FOR SEQ ID NO: 2099 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 22 bases 

(B) TYPE: Nucleic acid 

(C) STRANDEDNESS: Single 

(D) TOPOLOGY: Linear 
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(ii) MOLECULE TYPE: DNA 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 2099 
CCAGGACTGA ACGGGATACG AA 

2) INFORMATION FOR SEQ ID NO: 2100 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 18 bases 

(B) TYPE: Nucleic acid 

(C) STRANDEDNESS: Single 

(D) TOPOLOGY: Linear 

(ii) MOLECULE TYPE: DNA 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 2100 
GCGAGACGAT AGGTTGTC 18 
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22 



2) INFORMATION FOR SEQ ID NO: 2101 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 2609 bases 

(B) TYPE: Nucleic acid 

(C) STRANDEDNESS: Double 

(D) TOPOLOGY: Linear 

(ii) MOLECULE TYPE: Genomic DNA 

(vi) ORIGINAL SOURCE: 

(A) ORGANISM: Mycobacterium tuberculosis 

(B) STRAIN: H37Rv 

(C) ACCESSION NUMBER: Z79701 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 2101 

CAGCCCGCGA GCGTAACCTG GCTGCGATTT CCGGCGCGGA TTTTCGCAGT 50 

GCGGTTACGC TCGGAAAGCG CGGGCCTCGC CCACGCGGCG GATGATGTCA 100 

GCGGGGTGGT CCTCGGCGAC GACCCGGACC ACGATCCACC CGTAGCGGTG 150 

CTGGACTTTC TCGTGCCGGA GGATGTCTTT CCGGTAGTGG TAGCGACTGG 200 

TCAGATGGTG GTCGCCGTCA TACTCGGCCG CGACCTTGAT GTCTTGCCAG 250 

CCCATATCCA AATGGGCTTC CGCCCAGCCC CATTCGTTGC GCACCGCGAT 300 

CTGCGTCTGG GGGCGCGGAA AGCCGGCGCG GATCAACAAC AAGCGCAGCC 3 50 

AGGTTTCCTT GGGGGACTGG GCACCGCCGT CGACGAGGTC CAGAGCGGCT 400 

CTTGCGGCCT TCATGCCACG GCGGCCCCGA TAGCGCTCGA TCAGCGGCTC 450 

GACGTCGGCC ACCTTCAAAT CGGTGGCCTG TATCAGGGCG TCGACGGCCG 500 

CGACGGCGGG GTCCAATGGA AATCGACTGG TCAGGTCGAG CGCCGTTCGC 550 

TCCGGTGTGG TCACGCGCAT GCCCTCGATG ACGCAGATCT CGTCGGGCTC 600 

GATGCGCTCT TCCCAGACTT GCAGCCCCGG GGCACGGCGG CGGTTGGTGT 650 

CGATGATCGC GGCGGGAAGA TCCGCGTCGA TCCACTTGGC GCCATGGAAG 70 0 

GCAGAAGCCG AGTAGCCGGC CAGCACGCCG CGGCGGCGCG AGCGCAGCCA 750 

CAGCGCTTTT GCACGCAATT GCGCGGTCAG TTCCACACCC TGCGGCACGT 800 
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ACACGTCTTT ATGTAGCGCG ACATACCTGC 
AATACACCCG CAGCCAGGGC CTCGCTGCCC 
CGAAGTGTGC TGAGTCACAC CGACAAACGT 
GAAAGTTCCC GCCGGAAATC GCAGCCACGT 
TTTCGGCCCG GCCGCGGCGA GACGATAGGT 
CCACTGAAGG GGCCAAACCC CCATTCGTAT 
GGAGGAAACC GGGGGATCGG GCTGGCGATC 
CGGCCACAAG GTGGCCGTCA CCCACCGTGG 
TGTTTGGCGT CGAATGTGAC GTCACCGACA 
TTCACGGCGG TAGAAGAGCA CCAGGGTCCG 
CGCCGGCCTA TCCGCGGACG CATTCCTCAT 
TCGAGAAGGT CATCAACGCC AACCTCACCG 
CGGGCATCGC GCAGCATGCA GCGCAACAAA 
AGGTTCGGTC TCCGGCAGCT GGGGGATCGG 
CCTCCAAGGC CGGAGTGATT GGCATGGCCC 
TCGAAGGCAA ACGTGACCGC GAATGTGGTG 
CGATATGACC CGCGCGCTGG ATGAGCGQAT 
TTATCCCAGC GAAGCGGGTC GGCACCCCCG 
AGCTTCCTGG CTTCCGAGGA TGCGAGCTAT 
GGTCGACGGC GGCATGGGTA TGGGCCACTG 
TGACAGGACT GCTGGACGGC AAACGGATTC 
GACTCGTCGA TCGCGTTTCA CATCGCACGG 
CCAGCTGGTG CTCACCGGGT TCGACCGGCT 
CCGACCGGCT GCCGGCAAAG GCCCCGCTGC 
GAGGAGCACC TGGCCAGCTT GGCCGGCCGG 
GGGCAACAAG CTCGACGGGG TGGTGCATTC 
CCGGGATGGG CATCAACCCG TTCTTCGACG 
AAGGGCATCC ACATCTCGGC GTATTCGTAT 
GCTGGCGATC ATGAACCCCG GAGGTTCCAT 
CGAGCCGGGC GATGCCGGCC TACAACTGGA 
TTGGAGTCGG TCAACAGGTT CGTGGCGCGC 
GCGTTCGAAT CTCGTTGCCG CAGGCCCTAT 
CGATCGTCGG CGGTGCGCTC GGCGAGGAGG 
CTCGAGGAGG GCTGGGATCA GCGCGCTCCG 
TGCGACGCCG GTCGCCAAGA CGGTGTGCGC 
CGGCGACCAC GGGTGACATC ATCTACGCCG 
TTGCTCTAG 
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TGCGCAATTC GTAGGGCGTC 8 50 

AGAAAGGGAT CCGTCATGGT 90 0 

CACGAGCGTA ACCCCAGTGC 950 

TACGCTCGTG GACATACCGA 1000 

TGTCGGGGTG ACTGCCACAG 1050 

CCCGTTCAGT CCTGGTTACC 1100 

GCACAGCGGC TGGCTGCCGA 1150 

ATCCGGAGCG CCAAAGGGGC 12 00 

GCGACGCCGT CGATCGCGCC 1250 

GTCGAGGTGC TGGTGTCCAA 13 00 

GCGGATGACC GAGGAAAAGT 1350 

GGGCGTTCCG GGTGGCTCAA 14 00 

TTCGGTCGAA TGATATTCAT 14 50 

CAACCAGGCC AACTACGCAG 1500 

GCTCGATCGC CCGCGAGCTG 1550 

GCCCCGGGCT ACATCGACAC 1600 

TCAGCAGGGG GCGCTGCAAT 1650 

CCGAGGTCGC CGGGGTGGTC 170 0 

ATCTCCGGTG CGGTCATCCC 1750 

ACACAACACA AGGACGCACA 18 00 

TGGTTAGCGG AATCATCACC 18 50 

GTAGCCCAGG AGCAGGGCGC 1900 

GCGGCTGATT CAGCGCATCA 1950 

TCGAACTCGA CGTGCAAAAC 2000 

GTGACCGAGG CGATCGGGGC 2 050 

GATTGGGTTC ATGCCGCAGA 2100 

CGCCCTACGC GGATGTGTCC 2150 

GCTTCGATGG CCAAGGCGCT 2200 

CGTCGGCATG GACTTCGACC 22 50 

TGACGGTCGC CAAGAGCGCG 23 00 

GAGGCCGGCA AGTACGGTGT 23 50 

CCGGACGCTG GCGATGAGTG 24 00 

CCGGCGCCCA GATCCAGCTG 2450 

ATCGGCTGGA ACATGAAGGA 2500 

GCTGCTGTCT GACTGGCTGC 2550 

ACGGCGGCGC GCACACCCAA 2600 

2609 



2) INFORMATION FOR SEQ ID NO: 2102 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 20 bases 

(B) TYPE: Nucleic acid 

(C) STRANDEDNESS: Single 

(D) TOPOLOGY: Linear 

(ii) MOLECULE TYPE: DNA 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 2102 
CACCTTCACC CTGACCGACG 



2) INFORMATION FOR SEQ ID NO: 2103 
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(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 23 bases 

(B) TYPE: Nucleic acid 

(C) STRANDEDNESS: Single 

(D) TOPOLOGY: Linear 

(ii) MOLECULE TYPE: DNA 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 2103 
CGAACCAGCG GAAATAGTTG GAC 23 



2) INFORMATION FOR SEQ ID NO: 2104 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 18 bases 

(B) TYPE: Nucleic acid 

(C) STRANDEDNESS: Single 

(D) TOPOLOGY: Linear 

(ii) MOLECULE TYPE: DNA 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 2104 
CTGGGCATGG CICGAGTC 18 



2) INFORMATION FOR SEQ ID NO: 2105 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 3297 bases 

(B) TYPE: Nucleic acid 

(C) STRANDEDNESS: Double 

(D) TOPOLOGY: Linear 

(ii) MOLECULE TYPE: Genomic DNA 

(vi) ORIGINAL SOURCE: 

(A) ORGANISM: Mycobacterium tuberculosis 

(B) STRAIN: H37rv 

(C) ACCESSION NUMBER: U68480 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 2105 

ATGACACAGT GCGCGAGCAG ACGCAAAAGC ACCCCAAATC GGGCGATTTT 50 

GGGGGCTTTT GCGTCTGCTC GCGGGACGCG CTGGGTGGCC ACCATCGCCG 100 

GGCTGATTGG CTTTGTGTTG TCGGTGGCGA CGCCGCTGCT GCCCGTCGTG 150 

CAGACCACCG CGATGCTCGA CTGGCCACAG CGGGGGCAAC TGGGCAGCGT 200 

GACCGCCCCG CTGATCTCGC TGACGCCGGT CGACTTTACC GCCACC6TGC 250 

CGTGCGACGT GGTGCGCGCC ATGCCACCCG CGGGCGGGGT GGTGCTGGGC 300 

ACCGCACCCA AGCAAGGCAA GGACGCCAAT TTGCAGGCGT TGTTCGTCGT 350 

CGTCAGCGCC CAGCGCGTGG ACGTCACCGA CCGCAACGTG GTGATCTTGT 400 

CCGTGCCGCG CGAGCAGGTG ACGTCCCCGC AGTGTCAACG CATCGAGGTC 450 
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ACCTCTACCC ACGCCGGCAC CTTCGCCAAC TTCGTCGGGC TCAAGGACCC 500 
GTCGGGCGCG CCGCTGCGCA GCGGCTTCCC CGACCCCAAC CTGCGCCCGC 55 0 

AGATTGTCGG GGTGTTCACC GACCTGACCG GGCCCGCGCC GCCCGGGCTG 600 
GCGGTCTCGG CGACCATCGA CACCCGGTTC TCCACCCGGC CGACCACGCT 650 
GAAACTGCTG GCGATCATCG GGGCGATCGT GGCCACCGTC GTCGCACTGA 7 00 

TCGCGTTGTG GCGCCTGGAC CAGTTGGACG GGCGGGGCTC AATTGCCCAG 750 
CTCCTCCTCA GGCCGTTCCG GCCTGCATCG TCGCCGGGCG GCATGCGCCG 800 
GCTGATTCCG GCAAGCTGGC GCACCTTCAC CCTGACCGAC GCCGTGGTGA 8 50 

TATTCGGCTT CCTGCTCTGG CATGTCATCG GCGCGAATTC GTCGGACGAC 900 
GGCTACATCC TGGGCATGGC CCGAGTCGCC GACCACGCCG GCTACATGTC 950 

CAACTATTTC CGCTGGTTCG GCAGCCCGGA GGATCCCTTC GGCTGGTATT 100 0 

ACAACCTGCT GGCGCTGATG ACCCATGTCA GCGACGCCAG TCTGTGGATG 1050 

CGCCTGCCAG ACCTGGCCGC CGGGCTAGTG TGCTGGCTGC TGCTGTCGCG 1100 

TGAGGTGCTG CCCCGCCTCG GGCCGGCGGT GGAGGCCAGC AAACCCGCCT 1150 

ACTGGGCGGC GGCCATGGTC TTGCTGACCG CGTGGATGCC GTTCAACAAC 12 0 0 

GGCCTGCGGC CGGAGGGCAT CATCGCGCTC GGCTCGCTGG TCACCTATGT 12 50 

GCTGATCGAG CGGTCCATGC GGTACAGCCG GCTCACACCG GCGGCGCTGG 13 0 0 

CCGTCGTTAC CGCCGCATTC ACACTGGGTG TGCAGCCCAC CGGCCTGATC 13 50 

GCGGTGGCCG CGCTGGTGGC CGGCGGCCGC CCGATGCTGC GGATCTTGGT 14 0 0 

GCGCCGYCAT CGCCTGGTCG GCACGTTGCC GTTGGTGTCG CCGATGCTGG 1450 

CCGCCGGCAC CGTCATCCTG ACCGTGGTGT TCGCCGACCA GACCCTGTCA 1500 

ACGGTGTTGG AAGCCACCAG GGTTCGCGCC AAAATCGGGC CGAGCCAGGC 1550 

GTGGTATACC GAGAACCTGC GTTACTACTA CGTCATCCTG CCCACCGTCG 1600 

ACGGTTCGCT GTCGCGGCGC TTCGGCTTTT TGATCACCGC GCTATGCCTG 1650 
TTCACCGCGG TGTTCATCAT GTTGCGGCGC AAGCGAATTC CCAGCGTGGC . 1700 

CCGCGGACCG GCGTGGCGGC TGATGGGCGT CATCTTCGGC ACCATGTTCT 1750 

TCCTGATGTT CACGCCCACC AAGTGGGTGC ACCACTTCGG GCTGTTCGCC 1800 

GCCGTAGGGG CGGCGATGGC CGCGCTGACG ACGGTGTTGG TATCCCCATC 1850 

GGTGCTGCGC TGGTCGCGCA ACCGGATGGC GTTCCTGGCG GCGTTATTCT 1900 

TCCTGCTGGC GTTGTGTTGG GCCACCACCA ACGGCTGGTG GTATGTCTCC 1950 

AGCTACGGTG TGCCGTTCAA CAGCGCGATG CCGAAGATCG ACGGGATCAC 2 000 

AGTCAGCACA ATCTTTTTCG CCCTGTTTGC GATCGCCGCC GGCTATGCGG 2050 

CCTGGCTGCA CTTCGCGCCC CGCGGCGCCG GCGAAGGGCG GCTGATCCGC 2100 

GCGCTGACGA CAGCCCCGGT ACCGATCGTG GCCGGTTTCA TGGCGGCGGT 2150 

GTTCGTCGCG TCCATGGTGG CCGGGATCGT GCGACAGTAC CCGACCTACT 22 00 

CCAACGGCTG GTCCAACGTG CGGGCGTTTG TCGGCGGCTG CGGACTGGCC 2250 

GACGACGTAC TCGTCGAGCC TGATACCAAT GCGGGTTTCA TGAAGCCGCT 23 00 

GGACGGCGAT TCGGGTTCTT GGGGCCCCTT GGGCCCGCTG GGTGGAGTCA 23 50 

ACCCGGTCGG CTTCACGCCC AACGGCGTAC CGGAACACAC GGTGGCCGAG 24 00 

GCGATCGTGA TGAAACCCAA CCAGCCCGGC ACCGACTACG ACTGGGATGC 24 50 

GCCGACCAAG CTGACGAGTC CTGGCATCAA TGGTTCTACG GTGCCGCTGC 2 500 

CCTATGGGCT CGATCCCGCC CGGGTACCGT TGGCAGGCAC CTACACCACC 2550 

GGCGCACAGC AACAGAGCAC ACTCGTCTCG GCGTGGTATC TCCTGCCTAA 26 00 

GCCGGACGAC GGGCATCCGC TGGTCGTGGT GACCGCCGCG GGCAAGATCG 2650 

CCGGCAACAG CGTGCTGCAC GGGTACACCC CCGGGCAGAC TGTGGTGCTC 2 700 

GAATACGCCA TGCCGGGACC CGGAGCGCTG GTACCCGCCG GGCGGATGGT 2750 

GCCCGACGAC CTATACGGAG AGCAGCCCAA GGCGTGGCGC AACCTGCGCT 2800 

TCGCCCGAGC AAAGATGCCC GCCGATGCCG TCGCGGTCCG GGTGGTGGCC 2850 

GAGGATCTGT CGCTGACACC GGAGGACTGG ATCGCGGTGA CCCCGCCGCG 2900 

GGTACCGGAC CTGCGCTCAC TGCAGGAATA TGTGGGCTCG ACGCAGCCGG 2950 

TGCTGCTGGA CTGGGCGGTC GGTTTGGCCT TCCCGTGCCA GCAGCCGATG 3 00 0 

CTGCACGCCA ATGGCATCGC CGAAATCCCG AAGTTCCGCA TCACACCGGA 3 050 

CTACTCGGCT AAG7AGCTGG ACACCGACAC GTGGGAAGAC GGCACTAACG 3100 

GCGGCCTGCT CGGGATCACC GACCTGTTGC TGCGGGCCCA CGTCATGGCC 3150 

ACCTACCTGT CCCGCGACTG GGCCCGCGAT TGGGGTTCCC TGCGCAAGTT 3200 

CGACACCCTG GTCGATGCCC CTCCCGCCCA GCTCGAGTTG GGCACCGCGA 3250 

CCCGCAGCGG CCTGTGGTCA CCGGGCAAGA TCCGAATTGG TCCATAG 3297 
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2) INFORMATION FOR SEQ ID NO: 2106 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 31 bases 

(B) TYPE: Nucleic acid 

(C) STRANDEDNESS: Single 

(D) TOPOLOGY: Linear 

(ii) MOLECULE TYPE: DNA 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 2106 

GCGAGCAGAG CACGCCCTCC TCGCCGCTCG C 31 
2) INFORMATION FOR SEQ ID NO: 2107 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 32 bases 

(B) TYPE: Nucleic acid 

(C) STRANDEDNESS: Single 

(D) TOPOLOGY: Linear 

(ii) MOLECULE TYPE: DNA 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 2107 
GCGAGCTCCC CATCTCTGGT TGGCACGCTC GC 32 



2) INFORMATION FOR SEQ ID NO: 2108 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 41 bases 

(B) TYPE: Nucleic acid 

(C) STRANDEDNESS: Single 

(D) TOPOLOGY: Linear 

(ii) MOLECULE TYPE: DNA 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 2108 
GCGGGCAACT TCRTCAAGAA GGTTGGTTAC AACCCGCCCG C 41 



2) INFORMATION FOR SEQ ID NO: 2109 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 38 bases 

(B) TYPE: Nucleic acid 

(C) STRANDEDNESS: Single 

(D) TOPOLOGY: Linear 

(ii) MOLECULE TYPE: DNA 
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(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 2109 
GCGGGCCCAA TCTCTGGTTG GAAYGGTGAC AAGCCCGC 38 



2) INFORMATION FOR SEQ ID NO: 2110 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 41 bases 

(B) TYPE: Nucleic acid 

(C) STRANDEDNESS : Single 

(D) TOPOLOGY: Linear 

(ii) MOLECULE TYPE: DNA 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 2110 
GCGGGCCCTT AACGATTTCA GCGAATCTGG ATTCAGCCCG C 41 



2) INFORMATION FOR SEQ ID NO: 2111 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 41 bases 

(B) TYPE: Nucleic acid 

(C) STRANDEDNESS: Single 

(D) TOPOLOGY: Linear 

(ii) MOLECULE TYPE: DNA 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 2111 
GCGGGCATGA TTGAAGCCAC CACCAACGCT TCCTGGCCCG C 41 



2) INFORMATION FOR SEQ ID NO: 2112 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 43 bases 

(B) TYPE: Nucleic acid 

(C) STRANDEDNESS: Single 

(D) TOPOLOGY: Linear 

(ii) MOLECULE TYPE: DNA 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 2112 
GCGGGCTTGA TGAAGTTTTG GGTTTCCTTG ACAATTTGCC CGC 4 3 



2) INFORMATION FOR SEQ ID NO: 2113 
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(i) SEQUENCE CHAEy^CTERISTICS : 

(A) LENGTH: 41 bases 

(B) TYPE: Nucleic acid 

(C) STRANDEDNESS: Single 

(D) TOPOLOGY: Linear 

(ii) MOLECULE TYPE: DNA 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 2113 
GCGGGCACAA GGGTTGGACT AAGGAAACCA AGGCAGCCCG C 



2) INFORMATION FOR SEQ ID NO: 2114 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 41 bases 

(B) TYPE: Nucleic acid 

(C) STRANDEDNESS: Single 

(D) TOPOLOGY: Linear 

(ii) MOLECULE TYPE: DNA 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 2114 
GCGGGCATCG ATGCTATTGA ACCACCTGTC AGACCGCCCG C 
2) INFORMATION FOR SEQ ID NO: 2115 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 39 bases 

(B) TYPE: Nucleic acid 

(C) STRANDEDNESS: Single 

(D) TOPOLOGY: Linear 

(ii) MOLECULE TYPE: DNA 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 2115 
GCGGGCTTGA TGATTTCCTC GAATCTAGAT TGGGCCCGC 



2) INFORMATION FOR SEQ ID NO: 2116 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 39 bases 

(B) TYPE: Nucleic acid 

(C) STRANDEDNESS: Single 

(D) TOPOLOGY: Linear 

(ii) MOLECULE TYPE: DNA 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 2116 

GCGGGCGGTA AGTCCACCGG TAAGACCTTG TTGGCCCGC 
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